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Abstract.  Consider  a  matroid  where  each  element  has  a  real-valued  cost 


and  a  color,  red  or  green;  a  base  is  sought  that  contains  q  red  elements 
and  has  smallest  possible  cost.  An  algorithm  for  the  problem  on  general 
matroids  is  presented,  along  with  a  number  of  variations.  Its  efficiency 
is  demonstrated  by  implementations  on  specific  matroids.  In  all  cases 
but  one,  the  running  time  matches  the  best-known  algorithm  for  the  problem 
without  the  red  element  constraint. 

On  graphic  matroids,  a  smallest  spanning  tree  with  q  red  edges 
can  be  found  in  time  O(nlogn)  more  than  what  is  needed  to  find 
a  minimum  spanning  tree.  A  special  case  is  finding  a  smallest  spanning 
tree  with  a  degree  constraint;  here  the  time  is  only  0(m+n)  more  than 
that  needed  to  find  one  minimum  spanning  tree. 

On  transversal  and  matching  matroids,  the  time  is  the  same  as  the 
best-known  algorithms  for  a  minimum  cost  base.  This  also  holds  for 
transversal  matroids  for  convex  graphs,  which  model  a  scheduling  problem 
on  unit-length  jobs  with  release  times  and  deadlines. 

On  partition  matroids,  a  linear- time  algorithm  is  presented. 

Finally  an  algorithm  related  to  our  general  approach  finds  a  smallest 
spanning  tree  on  a  directed  graph,  where  the  given  root  has  a  degree 
constraint.  Again  the  time  matches  the  best-known  algorithm  for  the 
problem  without  the  red  element  (i.e.,  degree)  constraint. 

Key  Words,  matroid,  matroid  intersection  problem,  swap,  graph,  minimum 
cost  spanning  tree,  degree  constraint,  matching,  convex  bipartite  graph, 
job  schedule,  release  times  and  deadlines,  partition  matroid,  linear¬ 
time  selection,  minimum  cost  directed  spanning  tree. 


1.  Introduction. 


Matroids  offer  a  model  for  a  wide  variety  of  discrete  mathematical 
structures.  This  paper  investigates  a  combinational  problem  from  a  general 
matroid  point  of  view,  and  also  from  the  viewpoint  of  specific  matroids. 

To  state  the  problem,  consider  a  matroid  where  each  element  has  a  real¬ 
valued  cost  and  a  color,  red  or  green.  We  seek  a  base  of  the  matroid 
that  has  smallest  possible  cost  subject  to  the  constraint  that  it  contains 
exactly  q  red  elements,  for  a  given  q. 

Our  problem  can  be  viewed  as  a  matroid  intersection  problem.  To 
do  this  let  one  matroid  be  the  given  one  and  let  a  second  matroid  be  a 
partition  matroid  induced  by  the  coloring;  we  seek  a  minimum  cost  base 
of  the  two  matroids.  The  general  matroid  intersection  problem  is 
polynomial ly-bounded,  but  the  time  bound  is  of  high  degree  [L2,L1  pp.,300-355] 
Our  family  of  intersection  problems  can  be  solved  more  efficiently  because 
of  the  simple  structure  of  the  second  matroid. 

Section  2  reviews  some  relevant  notions  from  matroid  theory. 

Section  3  presents  an  algorithm  for  our  problem  on  general  matroids,  along 
with  several  variations.  Applications  to  specific  matroids  are  given  in 
the  following  sections. 

Section  4  investigates  graphic  matroids.  Here  the  problem  is  to 
find  a  smallest  spanning  tree  (or  forest)  that  contains  exactly  q  red 
edges.  Our  general  algorithm  runs  in  time  0(m  log  log^2+ni/n)n 
+  nlogn)(n  is  the  number  of  vertices,  m  the  number  of  edges). 

This  is  only  0(n  log  n)  time  more  than  the  time  to  solve  the 

problem  without  the  red  element  constraint.  Section  4  also  investigates 

the  problem  of  finding  a  smallest  spanning  tree  where  a  given  vertex  has 
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prespecified  degree.  This  problem  arises  in  the  design  of  certain 
communication  networks.  Previous  algorithms  have  been  given  [GK] ,  the  best 


using  time  0(m  loq  loq/0 . _  ,  xn  +  nloqn)  [Gall.  Thn  nrnhlam  ic 
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of  our  problem,  where  the  edges  incident  to  the  given  vertex  comprise  one 

color.  We  show  this  special  case  can  be  solved  in  time 
0(m  log  l°g(2+m/n)n) ;  more  Precise^y»  the  time  is  the  time  to  find  one 
minimum  spanning  tree  plus  0(m+n)  extra  processing.  (So  in  this  case  our 
problem  is  linear-time  equivalent  to  the  problem  without  the  red  element 
(i.e.,  degree)  constraint.) 

Section  5  investigates  matching  matroids  and  in  particular, 
transversal  matroids  for  convex  bipartite  graphs.  Our  problem  on  the  latter 

class  corresponds  to  the  following  scheduling  problem:  A  machine  executes  n 
unit-length  jobs,  chosen  from  a  set  of  m  jobs;  each  job  has  a  cost,  release  time 
deadline, and  job  class  (red  or  green);  find  a  smallest  cost  schedule  with 
exactly  q  jobs  of  the  red  class.  If  all  release  times  are  0,  we  give  an 
0(m  +  nlogn)  time  algorithm.  If  release  times  are  arbitrary,  we  give  an 
0(mlogn  +  n  )  algorithm.  In  a  general  matching  or  transversal  matroid,  we 
give  an  0(mlogm+ne)  time  algorithm  (here  m  is  the  number  of  vertices 
of  the  graph,  n  is  the  number  of  edges  in  a  maximum  matching  (thus  n  <  m), 
and  e  is  the  number  of  edges  in  the  graph.)  In  all  cases  our  time  bound 
equals  the  best-known  bound  for  finding  a  minimum  cost  base  of  the 
matroid  without  the  red  element  constraint. 


Section  6  investigates  partition  matroids.  Here  the  problem  is, 

given  a  set  with  a  partition,  where  each  element  has  a  cost  and  color; 

th 

find  a  smallest  subset  containing  exactly  n^  elements  from  the  i  block 
of  the  partition,  and  exactly  q  red  elements.  We  give  a  linear-time 
(and  hence  optimal)  algorithm  for  this  problem. 
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Section  7  discusses  the  problem  of  finding  a  smallest  directed 
spanning  tree  with  a  prespecified  root,  where  the  root  has  a  prespecified 
degree.  This  is  closely  related  to  our  matroid  intersection  problem. 
However,  si  nee  the  solution  is  a  base  of  three  matroids  rather  than  two, 
the  general  theory  does  not  apply.  We  present  an  0 (m  l°9(2+m/n)n) 
algorithm.  Again  this  bound  matches  the  best-known  algorithm  for  the 
problem  without  the  degree  constraint. 

For  the  algorithms  that  are  not  optimal  we  give  a  weak  type  of 
lower  bound:  We  show  that,  in  a  well-defined  sense,  any  algorithm  using 
an  approach  similar  to  ours  has  a  nonlinear  lower  bound.  In  some  cases 
this  bound  matches  our  upper  bound. 

All  of  our  algorithms  easily  generalize  to  the  problem  where  the 
desired  base  contains  at  most  (or  at  least)  q  red  elements.  Thus  this  paper 
gives  evidence  that  a  "q  red  elements"  constraint  can  often  be  handled 
efficiently. 
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2.  Matroid  Preliminaries. 

This  section  reviews  some  basic  facts  about  matroids.  It  is 
assumed  that  the  reader  is  familiar  with  an  introductory  treatment  of 
matroids,  such  as  [LI ,Ch .7]  or  [W,Ch.l].  Our  terminology  comes  mainly 
from  the  former.  Definitions  of  fundamental  concepts  such  as  independent 
set,  base,  and  circuit  can  be  found  in  these  sources. 

We  use  a  convenient  shorthand  notation  for  set  operations:  If 
S  is  a  set  and  e  an  element,  then  S  +  e  is  the  set  S  u  {e},  and  $  -  e 
is  S  -  { e } .  We  sometimes  use  +  instead  of  u,  as  in  B  -  G  +  R  (for  sets 
B,  G,  R).  When  parentheses  are  omitted,  operators  are  associated  to  the 
left,  e.g.,  S  +  e  -  f  is  (S+e)  -  f. 

We  use  graphic  matroids  to  illustrate  the  discussion  on  general 
matroids.  A  graphic  matroid  derives  from  a  graph  or  multigraph.*  The 
elements  of  the  matroid  are  the  edges  of  the  graph.  The  independent  sets 
are  the  forests,  and  the  bases  are  the  spanning  forests  (or  spanning  trees, 
if  the  graph  is  connected).  Figure  2.1  shows  a  graph  (with  solid  and 
dotted  edges);  Figure  2.2  gives  several  bases.  (We  will  give  a  more 
precise  description  of  these  figures  shortly.) 

Our  results  follow  from  one  simple  property  of  matroids, which  can 
be  included  in  a  set  of  matroid  axioms: 

Symmetric  Swap  Axiom  [B,  Wp.  15].  If  B^  and  are  bases  and  element 
f  e  B^,  then  there  is  an  element  e  e  B^  such  that  both  B^  -  f  +  e  and 

B^  -  e  +  f  are  bases. 

The  Symmetric  Swap  Axiom  is  a  strong  formulation  of  the  more  standard 
base  axiom  of  a  matroid  [LI,  p.  274].  It  can  be  derived 

^Throughout  this  paper  graphs  are  undirected  unless  explicitly  specified 
otherwise.  Also  when  our  algorithms  work  on  multigraphs  we  say  so  and  then 
proceed  to  give  the  discussion  in  terms  of  graphs. 
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from  the  independence  axiom  [L Lp. 268]  with  1 ittl e  difficulty.  In  Figure 
2.2  bases  and  have  the  symmetric  swap  of  edges  2,5. 

Another  important  notion  is  the  fundamental  circuit  C(e,B)  for 
an  element  e  and  a  base  B,  where  e  i  B.  This  is  the  unique  circuit  in 
B  +  e.  C(e,B)  always  exists  for  e  i  B,  and  e  e  C(e,B)  [L,p270]. 

If  S  is  a  set  of  elements,  del eti ng  S  from  M  gives  the  matroid 
M-S.  Its  elements  are  those  of  M,  excluding  S.  Its  independent  sets 
are  the  independent  sets  of  M  that  do  not  intersect  S.  If  S  is  an 
independent  set  of  elements  of  M,  contracting  S  gives  the  matroid  M/S. 

Its  elements  are  also  those  of  M,  excluding  S.  Its  independent  sets 
(respectively,  bases)  are  the  sets  I  of  elements  of  M/S  such  that  I  u  S 
is  an  independent  set  (base)  of  M.  (We  restrict  ourselves  to  contracting 

independent  sets  for  convenience  only)  . 

In  this  paper  we  investigate  the  following  problem.  Given  is  a 

matroid  M.  Each  element  e  has  a  real-valued  cost  c( e) .  Each  element 
is  colored  either  red  or  green.  We  seek  a  minimum  cost  base  that 
contains  exactly  q  red  elements,  for  a  given  value  q. 

In  the  graphic  matroid  of  Figure  2.1,  the  red  edges  are  solid  and 
the  green  edges  are  dotted.  Each  edge  is  labelled  by  its  cost.  For 
convenience  we  identify  an  edge  by  its  cost  (e.g.,  edge  2  is  the  edge  of 
cost  2).  These  conventions  are  used  in  all  figures  of  the  paper. 

Figure  2.2  shows  bases  B.. ,  i  =0,  ...,  4,  where  B..  is  the  smallest  cost 

base  with  exactly  i  red  edges. 
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We  make  the  following  conventions,  m  denotes  the  number  of  elements 
of  M,  and  n^  denotes  the  number  of  elements  in  a  base  (i.e.,  the  rank 
of  the  matroid).*  For  convenience  we  often  omit  explicit  references  to 
element  costs.  For  instance  we  say  an  element  e  is  a  smallest  element  of 
a  set  S  if  it  has  the  smallest  cost  c(e);  similarly  we  refer  to  largest 
element,  smallest  base,  etc.  Finally,  3  denotes  the  set  of  all  bases 
with  exactly  i  red  elements  and  smallest  cost  possible.  Our  problem 
is  to  find  a  base  in  3  .  Note  that  there  are  integers  l  and  u  such  that 
3 .  is  nonempty  exactly  when  l  <  i  <  u.  This  is  easy  to  prove  using  the 
independence  axiom  for  matroids.  It  also  follows  from  Theorem  3.1  below. 

Our  problem  can  be  viewed  as  a  matroid  intersection  problem.  Define 
a  matroid  M1  on  the  given  elements,  where  a  base  of  M1  contains  exactly 
q  red  elements  and  n-q  green  elements.  M'  is  a  partition  matroid  (see 

Section  7  for  the  general  definition).  Thus  our  problem  is  to  find  a 
smallest  set  that  is  a  base  in  both  M  and  m',  or  less  precisely,  a  smallest 

intersection  of  M  and  a  (special)  partition  matroid. 


*  Note  that  in  a  graphic  matroid  this  convention  differs  from  the  usual 
one  for  graphs.  We  take  n  as  the  number  of  edges  in  a  spanning  forest, 
whereas  it  is  usually  the  number  of  vertices  in  the  graph.  The  first 
quantity  is  always  less  than  the  second,  and  they  differ  by  one  for 
connected  graphs.  We  only  use  n  in  asymptotic  estimates,  and  as  will 
be  seen,  these  two  properties  allow  n  to  be  interpreted  either  way  in 
these  estimates. 


3.  The  General  Algorithm. 


This  section  presents  an  algorithm  for  our  intersection  problem  on 
arbitrary  matroids;  useful  variations  are  also  given.  The  efficiency  is 
illustrated  for  the  case  of  graphic  matroids.  Subsequent  sections  show  that 
the  algorithm  and  its  variants  are  efficient  on  other  matroids. 

The  results  of  this  section  were  discovered  independently  by 
Dan  Gusfield  [Gu] .  Gusfield  investigates  the  problem  of 
uniformly  modifying  the  costs  of  red  elements.  His  derivation  is 
concise  and  elegant, and  contains  all  the  results  of  this  section  (either 
explicitly  or  implicitly).  Here  we  give  our  own  development  (different 
from  Gusfield' s)  so  the  paper  is  self-contained.  (Also,  some  ideas  of 
this  section  for  the  special  case  of  graphic  matroids  appear  in 
[Gal],  [U].). 

The  general  matroid  intersection  problem  can  be  solved  by  augmenting 
paths  [LI, pp. 326-48] .  In  our  problem  these  paths  are  particularly  simple, 
as  we  now  show. 

Definition  3.1.  A  swap  for  a  base  B  is  an  ordered  pair  of  elements  (e,f), 
where  e  e  B  is  green,  f  £  B  is  red  and  B  -  e  +  f  is  a  base.  The  cost  of 
(e,f )  is  c ( f )  -  c(e). 

We  say  that  (e,f)  is  a  swap  for  f,  for  f  and  B,  etc.  Following  a 
previous  convention,  we  say  (e,f)  is  a  smallest  (largest)  swap  if  its 
cost  is  as  small  (large)  as  possible. 

Figure  2.2  shows  four  swaps  executed  serially  on  the  matroid  of 
Figure  2.1.  A  base  B.  e  $_•  derives  from  B.  ,  e  B*  ,  by  a  swap.  The 

#  I  !  I  ""  X  I  “  L 

following  result  shows  this  is  true  in  general. 

Theorem  3.1  (Augmentation  Theorem).  Suppose  B  is  a  base  in  3^^  and 


3^  *  0.  If  (e,f)  is  a  smallest  swap  for  B,  then  B  -  e  +  f  e  3. . 

Proof.  It  suffices  to  find  a  swap  (e,f)  such  that  B  -  e  +  f  e  3..  For 
this  implies  that  (e,f)  is  a  smallest  swap  for  B,  and  further,  any  smallest 
swap  for  B  gives  a  base  in  3^.  (Also  it  implies  that  the  swap  of  the 
Theorem  always  exists). 

Choose  base  B1  e  3.  such  that  | B  n  B '  [  is  maximum.  Let  f  be  a  red 
element  in  b'  -B  (f  exists  since  b'  has  more  red  elements  than  B.)  By 
the  Symmetric  Swap  Axiom,  there  is  an  element  e  e  B  such  that  B-e  +  f 
and  b'  -f  +  e  are  bases.  Clearly  e  *  f. 

We  show  e  is  green  by  contradiction..  If  e  is  red  then  B1  -  f  +  e 
is  a  base  with  i  red  elements.  Thus  c(B'-f  +  e)>c(B'),  so  c(e) >  c(f) . 
Similarly  examining  base  B-e  +  f  shows  c(f)  >  c(e).  We  conclude 
c(e)  =  c ( f ) .  Thus  B'  -  f  +  e  is  a  base  in  3.  having  more  elements  in  common 
with  B  than  B ' .  This  is  the  desired  contradiction. 

Now  since  e  is  green,  B-e  +  f  is  a  base  with  i  red  elements,  and 

B1  -  f  +  e  is  a  base  with  i  -  1  red  elements.  The  latter  implies 

c(B‘  -  f  +  e)  >  c(B),  or  equivalently  c(B')  >  c(B-e  +  f).  This  inequality 

shows  B-e  +  f  e  3. ,  as  desired.  □ 

1 


Note  that  this  proof  is  easily  modified  to  show  that  3.  *  0  exactly 
when  l  <  i  <  u,  for  two  integers  l  and  u  (see  Section  2).  (If  3r# 

3$  *  0  for  r<s,  show  3r+1  *  0  by  starting  with  B  e  3f  and  choosing 
B  £  3s  as  in  the  proof). 


The  Augmentation  Theorem  implies  an  algorithm  for  our  problem: 
Start  with  a  base  In  3^.  In  general,  having  derived  a  base  in  3.,  find 
a  smallest  swap  and  derive  a  base  in  3.+^.  Repeat  this  procedure  until 


a  base  in  3  is  derived, 
q 
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We  improve  the  efficiency  of  this  approach  in  several  ways.  First 
we  show  that  the  elements  involved  in  swaps  can  be  drawn  from  a  restricted 
set.  Recall  u  is  the  largest  index  with  3, ,  *  0. 

I 

Corollary  3.1.  Let  B  be  a  base  in  and  B  a  base  in  $  .  Then  there 
is  a  smallest  swap  (e,f)  for  B,  with  e  e  B  -  B '  and  f  e  B  -B. 

Proof.  Let  (g,h)  be  a  smallest  swap  for  B.  We  first  find  a  smallest 
swap  (g,f)  with  f  e  B1  -B.  Then  we  find  the  desired  swap  (e,f). 

The  Symmetric  Swap  Axiom  applied  to  h,  B-g  +  h  and  B1  shows  there 
is  an  element  f  e  B1  such  that  B  -  g  +  f  and  B'  -f  +  h  are  bases,  f  is  red. 

For  otherwise  since  h  is  red,  B1  -f+h  is  a  base  with  u  +  1  red  elements, 
which  is  impossible. 

Since  f  is  red,  f  *  g.  Since  B-g  +  f  is  a  base,  we  have 
f  e  B '  -  B  and  (g,f)  is  a  swap  for  B.  For  (g,f)  to  be  a  smallest  swap 
we  must  have  c(h)  >  c(f).  To  see  this,  note  B  -f+h  has  u  red  elements, 
so  c(B'-f+h)  >  c(B'),  and  c(h)  >  c(f) .  Thus  (g,f)  is  as  desired. 

If  g  i  B 1  ,  then  take  e  =  g,  and  swap  (e,f)  gives  the  Corollary. 
Otherwise  find  e  by  applying  the  Symmetric  Swap  Axiom  to  g,  B-g  +  f,  and 
b'.  The  argument  is  analogous  to  the  one  above  and  is  left  to  the  reader.  □ 

Another  useful  fact  is  that  swaps  get  progress!' vely  more  expensive. 
Corollary  3.2  is  a  formulation  of  this  fact  that  is  used  in  Section  4.2; 
Corollary  3.3  is  another  formulation. 

Corollary  3.2.  Let  B  be  a  base  containing  a  green  element  e.  Let  (e,f)  be 
a  smallest  swap  involving  e,  and  set  B‘  =  B-e  +  f.  For  any  green  element 
g  e  B-e,  let  (g,h),  (g,h')  be  smallest  swaps  for  g  and  bases  B,  B  , 
respecti vely.  Then  c(g,h)  <  clg^'). 
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Proof.  Clearly  we  may  assume  ( g , h 1 )  is  not  a  valid  swap  for  B.  Now  apply 

i 

the  Symmetric  Swap  Axiom  to  g,  B  and  B-e  +  f-g  +  h  ,  to  show  that 

B-g  +  f  and  B-e  +  h'  are  bases.  The  former  shows  (g,f)  is  a  swap  (for  B), 

whence  c(f)  >  c(h).  The  latter  shows  ( e 9 h ' )  is  a  swap,  whence  c(h  )  >  c(f). 

Thus  c(h 1 )  >  c(h) ,  as  desired.  □ 

Corollary  3.3.  The  cost  of  a  base  in  $.  is  a  convex  function  of  f. 

Proof.  We  need  only  show  that  the  cost  of  an  optimum  swap  (e.  ,f. )  is 
nondecreasing  with  i.  This  follows  from  the  previous  result.  □ 

This  Corollary  shows  how  to  solve  a  modified  version  of  our  problem, 
where  the  desired  base  is  the  smallest  one  with  at  least  (or  at  most)  q  red 
elements.  To  do  this  first  find  a  minimum  cost  base.  If  it  satisfies 
the  red  element  constraint,  it  is  the  desired  base.  Otherwise  the  desired 
base  is  the  smallest  one  with  exactly  q  red  elements. 

The  time  for  this  procedure  is  the  time  to  find  a  minimum  base  plus 
the  time  to  solve  the  unmodified  problem.  The  latter  always  dominates. 

Thus  the  time  estimates  given  in  Sections  4-7  also  hold  for  the  modified 
problem. 

Returning  to  Corollary  3.1,  we  can  find  the  desired  base  B^  as 
follows.  First  find  B^  and  B  ,  bases  in  3^  and  3U ,  respectively.  Then 
repeatedly  swap  a  green  element  of  B^  for  a  red  element  of  B  ,  until 
B  is  derived. 

q 

In  this  approach  each  swap  must  have  the  smallest  cost  possible. 

The  bulk  of  the  time  is  spent  searching  for  these  smallest  swaps.  Searching 
is  complicated  by  the  fact  that  each  time  a  swap  is  executed,  a  new  base 
is  derived.  This  changes  the  set  of  valid  swaps,  and  necessitates  new 
searching.  To  cut  down  on  the  searching  we  derive  an  alternate 
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characterization  of  the  swaps  involved.  This  allows  us  to  find  the  swaps 
efficiently,  although  in  a  different  order. 

We  call  the  desired  sequence  of  swaps  the  "swap  sequence."  Actually 
it  is  convenient  to  use  this  term  in  a  slightly  more  general  context. 

Definition  3.2.  Let  B  be  a  base  and  R  an  independent  set  of  red  elements. 
Let  Bq  =  B.  Suppose  for  i  =  1,  ...,  r,  B.  =  B..^  -  e.  +  f.. ,  where  ( e^. ,  f  _. ) 
is  a  smallest  swap  for  B.  ^  that  has  f..  e  R;  further,  no  swap  for 
and  an  element  of  R  exists.  Then  (e. ,f. ),  i  =  l»  . .  .,r,  is  a  swap  sequence 
(for  B  and  R). 

When  B  e  ^  and  R  is  the  set  of  red  elements  in  a  base  of  3y,  then 
the  swap  sequence  is  the  one  we  seek.  Figure  2.2  shows  a  swap  sequence 
for  the  example  matroid. 

The  following  idea  is  the  key  to  our  characterization  of  the  swap 
sequence.  In  Figure  2.1-2,  consider  3,  the  smallest  red  element.  The 
smallest  swap  for  3  and  BQ  is  (2,3).  Although  this  is  not  the  first 
swap  of  the  swap  sequence  (as  one  might  guess),  it  is_  in  the  swap  sequence. 
It  is  not  hard  to  see  why:  a  green  element  in  the  circuit  C(3,BQ) 
cannot  give  a  better  swap  than  (2,3);  hence  C(3,BQ)  is  preserved  until 
swap  (2,3)  is  made. 

To  state  the  result  precisely,  let  h  be  a  smallest  element  of  R. 

Let  (g,h)  be  a  smallest  swap  for  h  and  B.  In  the  matroid  M-g/h,  take 
a  swap  sequence  for  base  B-g  and  red  elements  R-h.  Insert  (g,h)  as 
the  j  + 1st/  swap,  where  the  swap  sequence  begins  with  j  swaps  strictly 
smaller  than  (g,h).  Call  the  resulting  sequence  S. 

Lemma  3.1.  S  is  a  swap  sequence  for  B  and  R. 

Proof.  First  note  that  S  is  well-defined,  i.e.,  in  matroid  M-g/h,  B-g 
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and  R-h  are  sets  that  have  a  swap  sequence:  B-g  is  a  base  of  M-g/h, 
since  B-g  +  h  is  a  base  of  M;  similarly  R-h  is  independent. 

Now  let  S  be  the  sequence  (e^f. ),  i  =  l,  ...,r.  (So  for  i  =  j  +  1, 

( e.  ,f  i )  =  (g,h) . )  Let  BQ  =  B ,  and  for  i  =  1,  ...»  r,  let  B.  =B.-1-e1  +  f  1 . 

We  must  show  that  (e.  ,f . )  is  a  smallest  swap  for  B _  p  for  i  =  1 ,  . . . ,  t. 

The  argument  divides  into  three  cases:  the  first j  swaps,  the  j  +  1st 
swap  (g,h),and  the  remaining  swaps. 

Consider  the  first  j  swaps.  We  show  that  for  i  =  1 ,  . . . ,  j ,  if 
B.  j  is  a  base  and  circuit  C(h,B._^)  =  C(h,B),  then  (e.. ,f. )  is  a  smallest 
swap  for  B.  p  and  C( h , B ^ )  =  C(h,B).  Clearly  this  implies  the  desired 
conclusion,  by  induction  on  i. 

Base  B.  ^  is  the  result  of  executing  swaps  (epf^),  (e_j_pf.jp) 
on  base  B,  in  matroid  M.  In  matroid  M-g/h,  executing  these  same  swaps 
on  base  B  -  g  gives  B.  ^-g.  Now  we  show  a  useful  proposition: 

For  elements  e,f/{g,h};,  suppose  (e,f)  costs  less  than  (g,h).  Then 
(e,f )  is  a  swap  for  Bp1  (in  M)  if  and  only  if  it  is  a  swap  for  B.p-g 
(in  M-g/h) . 

Notice  that,  in  our  induction  on  i ,  i  goes  from  1  to  j.  However  in 
this  proposition,  we  allow  i  =  j  +  1.  The  proof  given  below  still  applies, 
and  the  proposition  is  useful  in  the  next  case,  the  j  +  1st  swap. 

The  proposition  is  equivalent  to  showing  that  in  M,  B.p-e  +  f  is 

a  base  if  and  only  IfB^-g-e  +  f+h  is  a  base.  Since  (e,f )  costs  less 

than  (g,h),  and  c(f)  >  c(h),  it  follows  that  c(e)  >  c(g).  Thus,  by  the 

definition  of  g,  e  £  C(h,B)  ='  C(h,Bp^).  Now  suppose  B._p  e  +  f  is  a 

base,  call  it  A.  Then  A  +  h  contains  the  circuit  C(h,B)  ,  and 

A-q  +  h  =  B.  -q-e+f+h  is  a  base,  as  desired.  Conversely,  suppose 
3  i-l 

B.  ,  -q-e  +  f  +  h  is  a  base,  call  it  A1.  Then  A 1  +  g  contains  C(h,B), 
i-l  3 
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artdA'-h  +  g  =  B.  ^-e  +  f  is  a  base,  as  desired.  This  proves  the 
proposi tion. 

Now  (e.,f.)  is  a  swap  for  B.  1  -g(in  M-g/h),  costing  strictly  less 

ii  i  -  j. 

than  (g,h).  The  proposition  shows  (e. ,f. )  is  a  swap  for  B.  y  Also,  from 

the  proof  of  the  proposition,  e.  i  C(h,B._^),  and  so  C(h,B_.)  =  C(h,B^_^)  = 

C(h,B).  It  remains  to  show  that  (e. ,f. )  is  a  smallest  swap  for  B._^. 

Suppose,  on  the  contrary,  that  a  swap  (e,f)  costs  less.  If  e,  f  £  (g , h } , 

then  the  proposition  shows  (e,f)  is  a  swap  for  B.  ^-g.  But  this  contradicts 

the  definition  of  (e.,f  .).  Thus  e  =  g  or  f  =  h.  Since  C( h , B ^ ^ )  =  C(h,B), 

the  smallest  swap  involving  g  or  h  is  (g,h).  (Recall  h  has  smallest  cost). 

But  (g,h)  costs  more  than  (e.  ,f.. ).  These  contradictions  show  ( e^. , f ^ )  is 

a  smallest  swap  for  B^  y  This  completes  the  analysis  of  the  first  j  swaps. 

For  the  j  +  1st  swap,  we  must  show  that  (g,h)  is  a  smallest  swap 

for  B..  From  the  induction  made  for  the  first  j  swaps,  C(h,B.)  =  C(h,B). 

J  J 

Thus  (g,h)  is  a  swap  for  B..  The  proposition  of  that  induction  (valid  for 

i=j+l)  shows  that  if  there  is  a  smaller  swap  for  B.  than  (g,h),  there  is 

also  a  smaller  one  for  B.-g  (in  M-g/h).  The  latter  is  false  by  supposition. 

J 

Hence  (g,h)  is  a  smallest  swap  for  B.,  as  desired. 

J 

Finally  consider  the  remaining  swaps.  We  show  by  induction  that 
for  i  =  j  +  2,  ...,r,  (e.  ,f. )  is  a  smallest  swap  for  B.  y  Base  B._^  is 
the  result  of  executing  swaps  (e^,f^),  ...,  (e-j  0nB.  Iri  matroid  M-  g/h , 

executing  these  same  swaps,  except  for  (ej+^,f.+^)  =  (g,h),  on  B-g, 
gives  B.  ^  -  h.  It  is  easy  to  see  that  (e,f)  is  a  swap  for  B^  ^  (in  M) 
if  and  only  if  it  is  a  swap  for  B..  ^-h  (in  M-g/h).  Thus  (e  .  ,f..)  is 
a  smallest  swap  for  B^_p  as  desired.  □ 

The  Lemma  can  be  used  iteratively  to  find  a  complete  swap  sequence. 

The  following  definition  is  useful. 
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Definition  3.3.  Let  B  be  a  base  and  let  R  be  a  set  of  red  elements  such 

that  R  plus  the  red  elements  of  B  form  an  independent  set.  Order  the 

elements  of  R  as  h.,  i  =  1,  r,  so  that  the  cost  is  nondecreasing.  Let 
1 

Hq  =  B.  Suppose  g .  and  FL ,  i  =  1 ,  . . . ,  r ,  are  such  that  =  FU  -  g..  +  h.. 
and  (g..,h.)  is  a  smallest  swap  for  h.  and  FK^.  Then  (g..  ,h.. ),  i  =  1, 

. ..,  r,  is  a  restricted  swap  sequence  (for  B  and  R). 

The  term  "restricted  swap  sequence"  derives  from  the  fact  that  we 
have  restricted  the  order  in  which  the  red  elements  get  swapped  into  the 
base.  Also  note  that  the  initial  condition  given  on  R  is  for  notational 
convenience  only.  One  consequence  is  that  |R|  becomes  the  length  of  a 
swap  sequence.  Figure  3.1  shows  a  restricted  sequence. 

Note  that  a  restricted  swap  sequence  for  B  and  R  always  exists, 
i.e.,  for  each  element  h.,  there  is  a  swap  (g_.,h,. ):  The  circuit 
C( h^ ,H  contains  a  green  element,  since  the  red  elements  of  B  and  R 

are  independent.  Now  g.  exists  as  a  largest  green  element  of  C(hi,Hi_1). 
In  Figure  3.1,  for  i  =1,  2,  3,  H.  is  not  a  base  in  the  optimum  set 
.  Fiowever  a  restricted  sequence  does  give  the  desired  swaps: 

Corollary  3.4.  A  restricted  swap  sequence  for  B  and  R  can  be  rearranged 
to  form  a  swap  sequence  for  B  and  R. 

Proof.  The  proof  is  by  induction  on  r.  The  base  case  r  =  0  is  vacuous. 
For  r  >  0,  Lemma  3.1  shows  (g^h^  is  in  a  swap  sequence,  where  the 
remaining  swaps  form  a  swap  sequence  for  B  -  g^  and  R  -  h^  in  matroid 
M  -  g  i  /h  ^ .  It  is  easy  to  check  that  (g..  ,h_. ),  i  =  2,...,r  is  a  restricted 
swap  sequence  for  B  -  g1  and  R-h^  By  induction  these  swaps  rearrange 
to  a  swap  sequence.  The  desired  conclusion  follows.  □ 
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To  actually  form  the  swap  sequence,  order  swaps  (g.,h  )  of  a 

1  i 

restricted  sequence  as  follows:  Sort  (g.  ,h.),  i  =  1  *  . . . ,  r  so 

that  their  cost  is  nondecreasing  and  for  swaps  (gn.  ,hn. )  of  equal  cost, 

the  index  i  is  increasing. 

Corollary  3.5.  A  restricted  swap  sequence,  ordered  as  above,  is  a 

swap  sequence. 

Proof.  Argue  as  in  Corollary  3.4.  □ 


For  some  matroids  Corollaries  3.4  and  3.5  give  the  best  way  to 
solve  our  problem.  For  other  matroids  a  di vi de-and-conquer  approach  can 
be  more  efficient.  We  modify  the  results  for  this  approach  as  follows: 
Choose  B  and  R  as  in  Definition  3.3.  Let  contain  the  )  -^-J  smallest 

elements  of  R,  R^  =  { h .  | i  =1,  . . . ,  |JJJ} *  Let  R^  contain  the  remaining 
red  elements ,  R^  =  {hm  |  i  =  jJJJ  +  1 ,  . . . ,  r}.  Let  Gj  be  a  set  of  J  ^  I  green 


red  elements,  R^  -  { h ^  |  i  J  +  1 ,  . . . ,  r} .  Let  be  a  set  of  J^^-J  green 

elements  such  that  B  -  G^  +  is  a  smallest  base  whose  red  elements  are 
exactly  the  red  elements  of  B  u  R^ .  Let  G^  be  a  subset  of  the  remaining 
green  elements  of  B  such  that  B  -  (Gn  u  G^)  +  R  is  a  smallest  base  whose  red 
elements  are  exactly  the  red  elements  of  B  u  R. 

Intuitively  we  expect  that  a  restricted  sequence  swaps  elements 


of  Gj  with  elements  of  R^,  and  similarly  for  G^  and  R^.  This  is  correct 
except  for  slight  complications  due  to  equal-cost  elements.  So  in  the 
matroid  M-R^/G^,  let  be  a  restricted  swap  sequence  for  base  B  - 
and  red  elements  R^ .  Let  G^  =  (g|(g,h)  is  in  S^}.  (As  indicated  above, 
it  is  not  necessarily  true  that  G1  =  G^ ' ) .  In  M-G^'/R^,  let  be  a 
restricted  swap  sequence  for  B  -  G^‘  and  R,,.  Let  S1  S2  be  the  sequence 
formed  by  concatenating  onto  the  end  of  S^. 
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Corollary  3.6.  is  a  restricted  swap  sequence  and  thus  can  be  ordered 

to  form  a  swap  sequence  for  B  and  R. 

Proof.  We  start  by  checking  that  the  construction  is  well-defined.  First 
note  the  set  G^  exists:  There  is  some  set  of  green  elements  G  with 
B  -  G  +  a  base  ( e .  g . ,  (g 
G1  is  a  smallest  set  G.  Similarly  we  see  G2  exists.  To  check  that  the 
sequence  exists,  note  that  in  M-R^/G^,  B  -  G^  is  a  base.  Further,  R^ 
plus  the  red  elements  of  B  -  G2  is  independent,  since  B  -  G1  +  is  a  base 
of  M  containing  G^.  To  check  that  exists,  note  that  in  M-G',/Rp  B-G'^  is 
a  base,  since  B-G^  +  R1  is  a  base  of  M.  Further,  R2  plus  the  red 
elements  of  B  -  G^'  is  independent,  si  nee  B  -  (G^  u  G2)  +  R  is  a  base  of  M 
containing  R^  and  these  red  elements. 

We  prove  the  Corollary  by  induction  on  r.  The  case  r  =  0  is  vacuous. 


L 

9  *  *  *  ’  U  J 


}  in  a  restricted  sequence). 


Assume  r  >  0.  Let  S1S0  =  (g . ,h . ) ,  i  =  1,  . . . ,  r.  In  M,  define  H. ,  i  =  1 , 

1  L  1  1  1 

. . .,  r  by  H.  =  H.  -  -  g.  +h..  Now  we  must  show  that  (g.,h.)  is  a  smallest 
J  i  i-l  3i  i  1  i 

swap  for  h  .  and  H . _ ^ ,  i  =  1 ,  . . . ,  r. 

First  consider  the  case  i  =  1,  ...»  jjr. j  .  In  matroid  M  -  R2/G2, 

swaps  (gr  hx),  ...,  (g^h.)  derive  the  base  Hi  -  G£.  Thus  H.  is  a  base  of  M, 

and  (g  .  ,h. )  is  a  swap  for  H ^ ^ . 

Now  let  (g,h.)  be  a  smallest  swap  for  h^  and  H._p  We  show  (g^  ,h. ) 
costs  no  more,  so  it  too  is  a  smallest  swap,  as  desired.  First  suppose 


g  i  G2.  Then  ( g , h ^ )  is  a  swap  for  _1-  G2  in  M-R2/G2.  Now  the  definition 
of  (g.9h.)  implies  it  costs  no  more  than  (g,h. ). 

On  the  other  hand,  suppose  g  e  G2 .  Consider  (in  matroid  M)  bases  S^  =  B  -  G^  +  R^ 
B2  =  H^^-g  +  h^.  Note  g  e  Bp  The  Symmetric  Swap  Axiom  applied  to 
g,  B^, and  B2  shows  there  is  an  element  g'  e  B2  such  that  B^-g  +  g'  and 
B 2  “  9 '  +  9  one  bases.  Note  that  g '  e  Gp  since  the  only  elements  of  B2  that 
are  not  in  B^  -  g  are  in  Gp  This  implies,  first  of  all,  that 
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i  , 

c(g  )  ^  c(g),  since  -  g  +  g  is  a  base  (recall  the  definition  of  B^). 

It  also  implies  c(g^)  ^  c(g'),  since  B^-g'  +g  is  a  base  containing 
(recall  the  definition  of  g_.).  These  inequalities  imply  c(g_.)  >  c(g).  Thus 
(gi ,h.)  costs  no  more  than  ( g ,  h ^ ),  as  desired. 


Now  consider  the  case  i 


•  _  r 

1  ? 

mf-m- 


+  1  5 


In 


G  ^  /  R  ^ 


swaps 


(g  ,  h  (g.  19h.  ,)  derive  the  base  H.  1  -  R-, .  (Here  it  is 

hd+1  |£|+1 

i 

necessary  that  we  use  matroid  M  -  /Rp  rather  than  M-G^/R^.)  It  is 
easy  to  see  (g,h^)  is  a  swap  for  H._^  (in  M)  if  and  only  if  it  is  a 
swap  for  H*_^-R^  (in  M-G^'/R^)-  This  gives  the  desired  result.  □ 


The  di vide-and-conquer  approach  derives  its  efficiency  from  doing 
the  computation  on  smaller  and  smaller  matroids.  Toward  this  end  the 
following  facts  are  useful. 

Lemma  3.2.  (a)  Choose  bases  B^  e  $£  and  Bu  e  3U-  Let  matroid 

M1  =  M  -  (BTlTB  )/  (Bn  n  B  ).  Then  a  swap  sequence  for  B„  -  B  and  the 

-L  U  -C  U  -C  U 

red  elements  of  B^  -  B^,  in  M1,  is  a  swap  sequence  for  B^  and  the  red 
elements  of  B  ,  in  M. 

(b)  Choose  B^  e  3^.  Then  there  is  a  base  B^  e  3y  such  that 

in  M1  (defined  as  in  (a)),  B^-Bu  is  a  base  of  all  green  elements  and 

B  -Bn  is  a  base  of  all  red  elements, 
u  t 

Proof,  (a)  Clearly  B^-B,.  and  the  red  elements  of  B(i-Bp  have  a  swap 
sequence  in  M1;  let  it  be  ( e  -  »f . ) ,  i  =  1,  ...»  r.  In  M,  let  the  sets 
derived  from  these  swaps  be  A.,  i.e.,  Aq  =  B^  and  for  i  =  1,  ...,  r, 

A^.  =  A..  ^  -  e..  +f-j.  The  first  i  swaps,  which  derive  A..  in  M,  derive 
Ai-(B£nBu)  in  M1  .  Thus  A.  is  a  base  of  M,  and  (e.,fi)  is  a  swap  for  Ai_r 
To  see  that  (e.,f.)  is  a  smallest  swap,  Corol  1  ary  3.1  shows  there  is  a  smallest 
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swap  (e,f )  with  e  e  A.  -  -  B  and  f  £  B  -  A.  , .  Since  e  is  green, 

I x  u  u  i  ““  x 

e  £  -  Bu;  similarly  f  e  B^  -  B^.  Thus  (e,f)  is  a  swap  for  A..^  -  (B^  n  Bj 

in  M  .  This  implies  that  (e^ ,f. )  costs  no  more  than  (e,f ) ,  as  desired. 

(b)  and  Bu  satisfy  the  desired  condition  if  B^  n  Bacons is ts  of  the 

green  elements  of  Bu  and  the  red  elements  of  B^.  To  find  an  appropriate 
B^  for  B^,  apply  the  Augmentation  Theorem  to  B^  until  a  base  of  3U  is 
derived.  □ 

Now  we  give  the  di vide-and-conquer  algorithm.  The  procedure  A(M,q) 
below  is  called  with  M  a  matroid  whose  elements  have  costs  and  colors, 
and  q  an  integer.  It  finds  a  base  B  with  exactly  q  red  elements  and 
smallest  cost  possible.  (It  halts  if  no  such  base  exists.) 

The  heart  of  A  is  the  recursive  procedure  S(N,B,R).  It  is  called 
with  N  a  matroid,  B  a  base  of  all  green  elements,  and  R  a  base  of  all 
red  elements.  S  finds  a  restricted  swap  sequence  for  B  and  R. 
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procedure^  A(M,q); 
begi  n 

procedure  S(N,B,R); 

1.  begin  if  |R|  =  1  then  make  (g,h)  a  swap,  where  B  =  { g } ,  R  =  {h} 


else  begin 


let  R^  be  the  set  of 
R2  =  R  -  ; 


L[I'J 


smallest  red  elements,  and 


let  B  -  + R^  be  a  smallest  base  whose  red  elements 

are  exactly  R^; 

S(N-R  /B-G, ,  G- ,  R, ) ;  comment  find  the  swaps  for  R, ; 

2  x  1  i  X 


end  end  S 


S(N-G^/Rp  B  -  Gp  R^) ;  comment  find  the  swaps  for  R^; 


let  B^  be  a  base  with  the  minimum  number  of  red  elements 
and  smallest  cost  possible; 

let  B^  be  a  base  with  the  maximum  number  of  red  elements 
containing  all  red  elements  of  B^,  containing  only 
green  elements  of  B^,  and  smallest  cost  possible; 

if  q  <  l  or  u  <  q  then  halt  comment  no  base  with  q  red 
elements  exists; 

S(H-(B^¥u)/(6,nBu),B,-Bu,Bu-B,); 
let  W  contain  the  q  -l  smallest  swaps  found  by  S; 
let  G  contain  the  green  elements  of  swaps  of  W; 
let  R  contain  the  red  elements  of  swaps  of  W; 

B:=B£-G  +  R  comment^  B  is  the  desired  base 


end  A; 
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Figure  3.2  illustrates  the  algorithm  on  Figure  2.1.  The  initial 
call  to  S  finds  the  base  of  Figure  3.2  (a),  resulting  in  recursive  calls 
on  the  graphs  of  Figure  3.2  (b).  The  algorithm  eventually  finds  the  swaps 
of  Figure  2.2. 

For  the  algorithm  to  work  correctly  we  must  break  ties  in  cost 

consistently,  as  in  Corollary  3.5.  We  shall  see  below  (Lemmas  4.1,  6.1) 

that  in  most  applications  of  the  algorithm  there  is  sufficient  time  to  sort. 

Thus  we  use  the  simple  rule  of  Corollary  3.5:  Assume  the  red  elements  have 

been  sorted  and  indexed  in  nondecreasing  order.  Then  in  line  2,  choose 

I R I 

the  yj  elements  of  R  with  smallest  index.  In  line  10,  if  there  is  a 
tie  for  the  q  -  smallest  swap:,  choose  the  one  whose  red  element  has 

smaller  index  for  W.  (An  alternate  approach  to  tie-breaking  is  given 

in  Section  4,2). 

Theorem  3.2.  Procedure  A  finds  a  smallest  base  with  exactly  q  red  elements, 

if  one  exists. 

Proof.  We  first  check  that  procedure  S  is  correct:  when  called  with  B 
a  base  of  all  green  elements  and  R  a  base  of  all  red  elements,  S  finds  a 
restricted  swap  sequence.  We  prove  this  by  induction  on  |R|.  The  case 
| R |  =  1  is  handled  correctly  by  line  1. 

For  | R |  >  1,  lines  2-5  find  the  restricted  swap  sequence 
Corollary  3.6:  The  entrance  conditions  on  B  and  R  imply  that  the  set  of 
Corollary  3.6  is  B  -  in  the  algorithm.  By  induction  the  recursive 
call  of  line  4  constructs  S^.  It  is  easy  to  see  that  the  set  G^'  of 
Corollary  3.6  is  G^  in  the  algorithm.  By  induction  the  recursive  call 
of  line  5  constructs  S,-,.  Thus  S  works  correctly. 

Now  we  show  that  A  is  correct.  Lemma  3.2  (b)  shows  bases  B^  and  Bu 
of  lines  6-7  exist.  In  line  9,  S  finds  a  restricted  swap  sequence  for 
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and  V  ^  Lemma  3.2  (a).  Line  11  finds  the  desired  base  B,  by 
Corollary  3.5  and  the  Augmentation  Theorem.  □ 

Now  we  examine  the  efficiency  of  algorithm  A.  We  do  not  derive 
a  general  time  bound,  since  more  accurate  bounds  can  be  given  for  specific 
matroids.  We  start  by  discussing  three  properties  of  the  matroid  that  are 
desirable  for  an  efficient  implementation. 

First,  the  di vide-and-conquer  approach  of  A  depends  on  the  ability 
to  contract  and  delete  efficiently.  Specifically,  these  operations  are 
needed  in  lines  4,  5,  and  9. 

Second,  A  needs  an  efficient  algorithm  for  finding  a  minimum-cost 
base.  This  algorithm  can  be  used  for  lines  3,  6  and  7.  For  instance 
line  6  is  done  as  follows:  Delete  all  red  elements  and  find  a  smallest 

base  Br  Then  contract  Bl  and  find  a  smallest  base  B2.  Finally  set 

B£  =  B1  u  B2‘  L1nes  3  and  7  are  done  in  a  way-  (Another  way  to 

do  lines  3,  6  and  7,  without  contracting  or  deleting,  is  to  modify  the 
cost  function  so  the  desired  base  has  minimum  cost). 

The  third  desired  property  concerns  the  greedy  algorithm.  This 
algorithm,  which  finds  a  smallest  base  on  any  matroid,  works  as  follows 
[L 1 ,pp, 275-77] :  It  is  given  a  list  of  all  elements,  sorted  so  the  cost  is 

nondecreasing.  It  prunes  the  list  to  the  desired  base,  by  scanning  it 

from  beginning  to  end,  deleting  any  element  that  forms  a  circuit  with 
previous  (undeleted)  elements  of  the  list. 

Specific  matroids  often  have  algorithms  that  are  faster  than  the 
greedy  one.  These  of  course  are  the  method  of  choice  for  lines  6  and  7. 

However  the  greedy  algorithm  is  particularly  suited  for  line  3.  Even 
though  line  3  is  repeated  many  times  in  the  recursion,  the  sort  required 
by  the  greedy  algorithm  need  only  be  done  once.  The  details  are  as 
follows. 
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Procedure  S  is  called  with  bases  B  and  R  given  as  sorted  lists. 

Line  3  finds  the  desired  base  by  running  the  greedy  algorithm  on  the  list 
of  elements  of  R„  followed  by  the  list  of  elements  of  B.  The  sorted  lists 

'  i 

for  the  recursive  calls  of  lines  4  and  5  are  easily  constructed.  Thus  in 
procedure  S,  no  sorting  is  done  for  the  greedy  algorithm.  Instead,  linear¬ 
time  list  manipulation  is  done  inside  S,  and  one  sort  is  done  before  the 
first  call  to  S. 

This  brings  us  to  the  third  desirable  property  of  the  matroid:  The 
greedy  algorithm  runs  faster  than  other  minimum-cost  base  algorithms  if  the 
elements  are  given  in  sorted  order. 

We  illustrate  the  efficiency  of  algorithm  A  by  deriving  a  time  bound 
for  graphic  matroids.  Here  the  problem  is  to  find  a  smallest  spanning 
forest  with  q  red  edges. 

Graphic  matroids  have  the  three  properties  that  allow  efficient 

implementation  of  A.  First,  contraction  and  deletion  are  efficient,  each 
requiring  time  0(m+n).  Here  we  assume  the  graph  is  represented  by 
adjacency  lists.  To  contract  a  set  of  edges,  form  adjacency  lists  for 
the  new  graph,  using  a  linear  connectivity  algorithm  [AHU] .  Further, 
for  each  edge  in  the  contracted  graph,  record  the  edge  in  M  (the  original 
graph  input  to  A)  that  it  derives  from.  This  is  necessary  so  that  when 
swaps  are  formed  in  line  1  of  A,  the  swaps  consist  of  edges  of  M, 

Note  that  contractions  may  introduce  parallel  edges  in  the  graph. 
However  it  is  easy  to  see  that  in  A,  there  are  at  most  two  parallel  edges, 
one  of  each  color,  between  two  given  vertices.  Actually,  we  can  allow 
the  input  graph  M  to  contain  such  parallel  pairs. 

Concerning  the  second  property  needed  to  implement  A,  there  are 
efficient  algorithms  for  finding  a  minimum  cost  spanning  forest,  in  time 
0(m  log  log(2+m/n)n)  ^1me  [^",T2,Y].  And  for  the  third  property,  the  greedy 


algorithm  (i.e.,  Kruskal's  algorithm  for  minimum  spanning  trees)  is  even 
faster,  0(ma(m,n)),  if  the  edges  are  already  sorted  [AHU,  pp. 172-6]. 


Theorem  3.3.  procedure  A  finds  a  smallest  spanning  forest  with  q  red 

edges  (if  one  exists)  in  time  0(m  log  1°g(2+m/n)n  + n a(n,n)log  n)  and 
space  0(m+n) . 


Proof.  We  first  estimate  the  time  for  procedure  S.  S  is  called  with 
N  a  graph  consisting  of  two  spanning  forests,  B,  containing  all  green 
edges,  and  R,  containing  all  red  edges;  both  B  and  R  are  given  as  lists 
of  edges,  sorted  so  cost  is  nondecreasing,  n,  the  rank  of  the  matroid, 
is  the  number  of  edges  in  each  of  B  and  R.  Line  2  uses  0(n)  time. 

Line  3,  using  the  greedy  algorithm,  is  0(na(n,n)).  The  deletions  and 
contractions  in  lines  4  and  5  are  0(n).  So  if  t(n)  is  the  time  required 


by  S  on  a  graph  of  rank  n,  there  are  constants  c^,  c^  >  0  so  that 


t(n)  <  c^na(n,n)  +  t(|J  )  +  t(  Rj-  ),  for  n  >  1; 
t(l)  =  c2. 


"allows  by  induction  that  t(n) 


,n) log  n) . 


The  space  needed  by  S  is  0(n).  For  suppose  each  recursive  call 
stores  the  lists  B  and  R.  If  S(n)  is  the  space  required  on  a  graph  of 
rank  n,  there  are  constants  c^,  c^  >  0  so  that 

S(n)  <  c^n  +  S(  j  ),  for  n  >  1 ; 

S ( 1 )  =  c2. 

The  desired  bound  on  S  follows. 

Now  suppose  A  is  called  on  a  graph  of  rank  n,  with  m  edges. 

Lines  6-7,  using  an  efficient  minimum  spanning  tree  algorithm,  are 
0(m  log  log^2+ni^n^n) .  Line  9  is  0(n  a  (n,n)  log  n) .  (This  includes  the 
time  to  sort  the  edges  in  the  spanning  forests  B^-B^  and  B  -B^.) 
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Line  10  is  0(n),  using  a  linear  selection  algorithm  [BFPRT,SPP].  The 
desired  time  bound  for  A  follows.  The  space  is  obvious.  □ 

The  next  section  improves  this  time  bound  by  eliminating  the  factor 
a(n,n)  in  the  second  term.  Subsequent  sections  apply  A  to  other  matroids. 
The  efficiency  can  be  estimated  by  computations  similar  to  Theorem  3.3. 

Sections  4-6  all  use  algorithms  that  are  variants  of  A.  One  simple 
variant  is  to  replace  procedure  S  by  a  procedure  that  computes  a  restricted 
swap  sequence  directly  from  Definition  3.3.  This  approach,  coupled  with 
data  structures  and  algorithms  that  capitalize  on  special  features  of  the 
matroid,  gives  our  best  algorithms. 
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. 4 .  Graphic  Matroids 

This  section  discusses  our  problem  on  graphic  matroids.  As  mentioned 
before,  the  problem  here  is  to  find  a  smallest  spanning  forest  with  q  red 
edges.  First  we  use  the  "dynamic  tree"  data  structure  of  Sleator  and 
Tarjan  [ST]  to  solve  the  problem  in  0(m  log  log^2+myn ^n  +  n  log  n)  time.  Then 
we  discuss  a  special  case  of  the  problem  -  finding  a  smallest  spanning  tree 
with  a  degree  constraint.  We  show  this  case  is  linear-time  equivalent  to 
finding  an  (unconstrained)  minimum  spanning  tree. 

4.1.  Spanning  Forests 

This  section  shows  that  for  graphic  matroids  a  restricted  swap 
sequence  can  be  rapidly  computed  from  the  definition.  It  also  gives  a  lower 
bound  to  show  that  no  implementation  of  the  swap  sequence  approach  can  be 
faster. 

The  dynamic  tree  data  structure  allows  a  number  of  operations, 
including  the  following: 

find  max  (v)  -  return  an  edge  of  maximum  cost  on  the  tree  path  from 
v  to  the  root*, 

evert  (v)  -  modify  the  tree  so  that  v  is  the  root; 

cut  (v,w)  -  delete  the  tree  edge  (v,w); 

1 ink  ( v ,w,x)  -  link  the  trees  containing  v  and  w  by  adding  edge 
(v,w),  setting  the  cost  of  (v,w)  to  x. 

The  time  for  a  series  of  m  such  operations  is  0(n+mlogn)  [ST]. 

It  is  easy  to  see  how  to  use  these  operations  to  compute  a  restricted 
swap  sequence.  We  maintain  the  tree  T  so  that  any  red  edge  in  T  has  a 
modified  cost  equal  to  S-l,  where  S  is  the  smallest  (original)  cost  of  any 
edge.  Then  to  compute  the  smallest  swap  for  a  red  edge  h  =  (v,w),  make  v 
the  root,  by  evert(v);  find  the  edge  g  in  the  smallest  swap  (g,h),  by 
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fi nd  max(w) ;  and  execute  the  swap,  by  cut(g) ,  1 ink(v,w,  $-1) .  It  is  easy  to 
see  that  the  n  swaps  of  a  restricted  swap  sequence  are  found  in  time 
0(n  log  n) . 

Now  we  sketch  a  complete  algorithm  for  our  intersection  problem  on 
graphic  matroids,  based  on  algorithm  A.  First  find  a  smallest  spanning 
forest  with  the  minimum  number  of  red  edges  (line  6  of  A).  To  do  this, 
use  an  efficient  minimum  spanning  tree  algorithm,  with  one  modification: 

When  the  costs  of  edges  of  different  colors  are  compared,  always  declare  the 
red  edge  to  be  larger.  The  time  is  0(m  log  log (2+m/n)n^ •  [T2].  Similarly 
compute  base  (line  7).  Then  form  a  new  graph  by  contracting  B^  n  B^  and 
deleting  all  edges  but  B^  u  Bu>  Find  a  restricted  swap  sequence  in  this 
graph,  proceedi ng  as  described  above.  Finally  as  in  lines  10-11,  find  the 
desired  base  B. 

Theorem  4.1.  A  smallest  spanning  forest  with  q  red  edges  can  be  found,  if  it 

exists,  in  time  0(m  log  log^r;+m/n)n  +  n  log  n)  and  space  0(m+n).  □ 

Now  we  give  lower  bounds  that  indicate  how  close  to  optimal  our 

algorithm  is.  For  these  we  define  two  timing  functions:  tree(m,n) 
is  the  time  required  by  an  optimal  algorithm  to  find  a  minimum  spanning 
tree  on  a  graph  of  m  edges  and  rank  n;  sort(n)  is  the  time  required  to 
sort  n  numbers.  Note  tree  and  sort  are  both  defined  for  algorithms  on 
Random  Access  Machines.  The  comparison  tree  model  offers  strong  evidence 
that  sort  is  0  (n  log  n)  [F  ],  and  also  some  evidence  that  tree  is 
e(m  1  og  lo9(2+m/n)n)  ^CT’  T23- 

It  is  clear  that  any  algorithm  for  a  smallest  spanning  tree  with 
q  red  edges  requires  time  ft(tree(m,n) )  (let  all  edges  be  green  and  take 
q=0).  Also  from  the  above  discussion,  our  algorithm  uses  time 
0(tree(m,n)  +  n  log  n) .  So  the  algorithm  is  at  most  0(n  log  n) 
above  optimal . 
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We  can  show  that  among  algorithms  using  the  swap  sequence  approach, 
ours  is  optimal  to  within  a  constant  factor.  To  be  precise,  say  that  an  algorithm 

"uses  the  swap  sequence  approach"  if,  given  a  graph  consisting  of  two  spanning 
trees,  one  green  and  one  red,  it  finds  the  swaps  (e.. ,f.)  that  constitute 
the  swap  sequence.  Note  the  algorithm  need  not  determine  the  correct 
order  of  the  swaps.  For  example,  procedure  A  uses  the  swap  sequence 
approach. 

Lemma  4.1.  Any  algorithm  using  the  swap  sequence  approach  requires 
time  fi(sort( n) ) . 

Proof.  We  give  a  procedure  that  sorts  n  arbitrary  numbers.  It  works  by 

constructing  a  rank  n+1  graph,  calling  the  algorithm,  and  then  processing 

the  swaps  given  by  the  algorithm  to  find  the  sorted  order.  Excluding  the 

time  for  the  algorithm,  the  procedure  uses  0(n)  time.  Since 

cln  ~  sort(n)  <  sort(n-l)  +  c2n  this  suffices  to  prove  the  Lemma. 

Consider  n  numbers  x^...^.  Without  loss  of  generality  assume 

all  numbers  are  positive  (otherwise  increase  the  numbers  by  a  constant). 

Let  M  be  the  largest  of  these  numbers  plus  one.  The  graph  for  x  x  is 

shown  in  Figure  4.1.  Formally,  the  graph  has  vertices  v,  w,  and  u. 

i 5 

i  =  1,  ...,  n;  green  edges  (v,ui)  costing  -x..  and  red  edges  (w,u.) 

costing  x.,  l  =  1,  n;  green  edge  (v,w)  costing  0  and  red  edge 
(v,w)  costing  M. 

To  specify  the  swap  sequence  of  this  graph,  let  the  given  numbers 
in  nondecreasing  order  be  y1,...,yn-  Then  identifying  each  edge  by 
its  cost,  the  swap  sequence  is  (O.yp,  ,y2) , . . . , (-y^.yj , 

(-yn.M). 
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The  procedure  constructs  the  graph  and  calls  the  algorithm  to 

find  the  swaps  of  the  swap  sequence.  Then  it  encodes  the  swaps  into 
an  array  S,  by  setting  S(i)  =  j  if  ((v>u.),  (w,u.))  is  a  swap,  and 

S  1  if  /fw  |,|)  \  \  J  .  r-  • 

7  °  "  vw,vy/5  VVVj  a  bWdp.  nnai  ly  it  outputs  the  sequence 

S(0),  S  (0) ,  . . . ,  S  (0) .  It  is  easy  to  see  this  gives  the  indices  of 
the  numbers  in  sorted  order.  Furthermore,  the  time  spent  before  and 
after  the  algorithm  is  0(n),  as  desired.  □ 


Cprallajxi^  Any  algorithm  that  finds  a  smallest  spanning  tree  with  q 
red  edges  using  the  swap  sequence  approach  requires  time 
ft(tree(m,n)  +  sort(n)).  □ 
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4.2,  Spanning  Trees  with  a  Degree  Constraint. 

Now  we  turn  to  a  special  case  of  our  problem  on  graphic  matroids 
that  has  some  practical  significance.  An  important  question  in  network 
design  is  how  to  link  a  central  computer,  having  a  limited  number  of 
communication  channels,  to  a  collection  of  peripheral  computing  sites. 
Some  versions  of  this  problem,  such  as  the  Capacitated  Tree  Problem, 
are  NP-complete  [P].  Closely  related  but  tractable  is  the  problem  of 
finding  a  smallest  spanning  tree  such  that  a  given  vertex  v  has  a 
specified  degree.  Polynomial  algorithms  have  been  presented  for  this 
problem  [GK] ;  the  most  efficient  uses  time  0(m  log  l°C(2+m/n)n  + n  n) 
[Gal],  We  show  here  that  the  problem  is  equivalent  to  finding  a  minimum 
spanning  tree.  More  precisely,  we  give  an  algorithm  for  the  problem 
that  finds  one  minimum  spanning  tree  and  then  does  0(n)  postprocessing. 

An  exact  statement  of  the  degree-constrained  spanning  tree 
problem  is  as  follows.:  Given  a  connected  graph  with  real -valued 
edge  costs,  find  a  spanning  tree  with  smallest  possible  cost  such  that 
the  degree  of  a  given  vertex  v  is  exactly  p.  We  solve  the  following 
problem:  Given  a  connected  graph  with  edge  costs  and  colors,  such  »  , 

that  all  green  edges  are  incident  to  v,  find  a  smallest  spanning  tree 
with  q  red  edges.  This  problem  includes  the  degree-constrained 
spanning  tree  problem  ,  if  we  take  q  =  n-p,  but  it  allows  red  edges 
incident  to  v.  (Note  that  in  both  problems  the  restriction  to  connected 
graphs  is  for  convenience  only.) 
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The  problem  is  simplified  by  making  the  desired  tree  unique. 

This  can  be  done  in  a  variety  of  ways.  Here  we  take  the  desired  tree 

to  be  lexicographically  minimum.  That  is,  assume  the  edges  of  the 

graph  are  indexed  from  1  to  m.  For  any  set  of  edges,  form  a  vector  by 

arranging  its  edge  indices  in  increasing  order,  i.e.,  for  {e.  ,e.  ,...,e.  } 

'l  1 2  1  k 

where  i^  <  i^  <  ...  c  i^,  form  (i^ ,  i^,  . . . ,  i  ) .  Now  among  all  smallest 
spanning  trees  with  q  red  edges,  our  algorithm  finds  the  one  whose  vector 
is  lexicographically  minimum. 

Note  that  this  tree  is  the  unique  smallest  spanning  tree  with 


q  red  edges  with  respect  to  a  certain  cost  function  c‘.  To  define  c ' , 
set  e  =  2m  (mi n{ |c(S)-c(T) I  .  S  and  T  are  sets  of  edges  with 
c(S)  *  c(T) }  u  {1}),  and  set  c'(e^)  =  cfe^)  -  e1 .  It  is  easy  to  check 
that  for  any  two  sets  S,  T,  c'(S)  <  c ' ( T)  if  and  only  if  c(S)  <  c(T)  or 
c(S)  =  c(T)  and  S  is  lexicographically  smaller  than  T.  This  implies  the 


desired  property  of  c  . 

Note  also  that  no  two  swaps  have  the  same  costc':  It  is  easy  to 

check  that  c'(e..se.)  <  c'fe^e^)  if  either  c(e..,e.)  <  c[e^9e^)  or 

c(e. ,e.)  =  c(e, ,e»)  and  the  vector  for  {e.,e„}  is  smaller  than  the 
i  j  k  Z  i  Z 

vector  for  (e  .,e,  }. 

J  K 

In  the  lemmas  that  follow,  we  assume  the  cost  function  c  has 
been  changed  to  c'.  Hence  no  two  edges  or  swaps  have  the  same  cost. 

In  the  algorithm,  we  can  use  c  without  explicitly  calculating  it,  by 
using  the  characterization  of  c'  in  terms  of  c  specified  above. 

We  make  two  more  assumptions.  First,  the  green  edges  form  a 
spanning  tree.  Second,  some  of  the  red  edges  form  a  spanning  tree, and 
the  red  edges  not  in  the  tree  are  incident  to  v. 
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(Note  that  by  preprocessing  as  in  procedure  A,  the  general  case  reduces 
to  the  case  where  the  green  and  red  edges  are  spanning  trees.  We  shall 
see  that  our  algorithm  may  introduce  extra  red  edges  incident  to  v.) 

Now  let  T. ,  i  =0,  n  be  the  smallest  spanning  tree  with 

exactly  i  red  edges.  Let  T.  =  T._^-e. +f. ,  i  =  1,  . ..,  n,  so  that 
(e..,f. ),  i  =  1,  . ..,  n,  is  the  swap  sequence.  As  noted  above,  T. ,  e.. 
and  f .  are  unique. 

Our  approach  is  to  start  with  Tq  and  repeatedly  find  sets  of 
edges  that  are  in  the  first  q  swaps.  We  do  not  find  the  swaps  themselves. 
The  following  simple  concept  is  central  in  deducing  the  edges  in  swaps. 

Definition  4.1.  For  a  green  edge  e  =  (v,w),  let  y(e)  be  the  smallest 
red  edge  incident  to  w.  Equivalently,  (e,y(e))  is  the  smallest  swap  for 
e  and  Tq . 

Note  y(e)  always  exists,  since  the  red  edges  span.  Let 

M  =  { (e ,u(e) )  |  e  is  green}.  The  M-swaps  for  a  graph  are  shown  in 

Figure  4.2.  The  next  four  lemmas  show  how  the  M-swaps  allow  us  to  deduce 

edges  in  T  „ 

q 

Lemma  4.2.  Suppose  (e,p(e))  is  not  a  valid  swap  for  some  optimal  tree 
T. ,  0  <  i  <  n.  Then  p(e)  e  T. . 

Proof.  Choose  i  as  small  as  possible  such  that  (e,y(e))  is  not  valid. 

Clearly  i  >  0.  It  suffices  to  prove  the  lemma  for  T. ,  since  this  implies 

y(e)  e  T .  for  j  >  i . 

3 

First  note  that  either  e  £  T.  or  y(e)  e  T^. .  For  otherwise 
e  e  T.  and  y(e)  £  T^ .  Since  (e,y(e))  is  not  valid,  the  cycle  C(y(e),T.) 
does  not  contain  e.  This  implies  C(y(e),T..)  does  not  pass  through  v  .  But 
then  C(y(e),T.)  consists  entirely  of  red  edges,  contradicting  the  fact  that 
the  red  edges  not  incident  to  v  are  acyclic. 
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Thus  wi thou t  1  oss  generality  e  ^  T\  .  Hence  T.  derives  from 

T.  1  by  doing  the  smallest  swap  for  e.  (e,u(e))  is  valid  for  T. 

1-1  '  ......  1-1 

by  definition.  Thus  Corollary  3.2  implies  it  is  the  smallest  swap  for  e, 
and  y(e)  e  T. .  □ 

Lemma  4.3.  If  (e,y(e))  is  among  the  q  smallest  swaps  of  M,  then 

y(e)  e  T  . 
q 

Proof.  Among  the  first  q  swaps  (e..,^.),  i  =  1 ,  . . . ,  q  of  the  swap 
sequence,  there  is  one  with  c(e- (e- ) )  >  c(e,y(e)).  Corollary  3.2 
implies  cfe^f.)  >  c(e,y(e)). 

Now  if  (e,y(e) )  is  a  valid  swap  for  T^. _1 ,  it  is  the  smallest  swap, 

and  thus  (e,y(e) )  =  (e.  ,f^. ) .  (Recal  1  no  two  swaps  gave  the  same  cost . )  Hence 
y(e)  £  Tq .  Otherwise,  if  (e,y(e) )  is  not  valid.  Lemma  4.2  implies 
y(e)  £  T  .  □ 


In  Figure  4.2,  the  lemma  implies  edges  6  and  8  are  in  T3,  as 
shown  in  Figure  4.3.  Notice  the  lemma  does  not  give  q  =  3  distinct  red 
edges  in  T  .  In  general,  the  lemma  need  only  give  j  red  edges,  since 
a  red  edge  can  be  the  y  -  value  of  two  green  edges. 


Lemma  4.4.  If  (e,y(e))  is  not  among  the  2 q -1  smallest  swaps  of  M, 

then  e  £  T  . 

q 


Proof.  We  argue  by  contradiction.  Suppose  e  i  T  .  Then  some  swap 

q 

i  =  q,  in  the  swap  sequence  has  e.  =  e.  From  Corollary 

3.2,  c(e,y(e))  <  c(e. ,f - )-  Clearly  none  of  the  2 q- 1  swaps  of  M  that 

are  strictly  smaller  than  (e,y(e) )  is  valid  for  T._r  These  swaps  contain 
at  least  q  distinct  red  edges,  and  Lemma  4.2  shows  they  are  all  in 
Tm.  Since  i  -  1  <  q ,  this  is  the  desired  contradiction.  □ 
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This  Lemma  is  illustrated  by  edge  2  in  Figures  4.2-3. 

Lemmas  4.3-4  allow  us  to  deduce  edges  in  T  .  Now  we  show  how 

q 

these  lemmas  can  be  itpr^tpH  i  pt  f  hp  anv  spf  of  edaes  in  T  Form  a 

v  q  * 

multigraph  H  as  follows.  First  form  G/F.  In  a  slight  abuse  of  notation, 
let  v  denote  the  vertex  of  G/F  that  contains  the  original  vertex  v  of  G. 

If  G/F  contains  parallel  edges,  they  are  incident  to  v  (since  two 
parallel  edges  not  incident  to  v  give  a  cycle  of  red  edges  in  G-v, 
a  contradiction.)  Now  for  every  set  of  parallel  edges  (all  incident  to  v) 
of  the  same  color,  delete  all  but  the  edge  of  smallest  cost.  The  resulting 
multigraph  is  H.  (This  is  illustrated  in  Figure  4.4  (a).)  We  show 
that  T  corresponds  to  an  optimal  tree  in  H.  Let  p  be  the  number  of  red 

H 

edges  in  F. 

Lemma  4.5.  In  the  multigraph  H,  the  edges  T  -F  form  the  smallest  spanning 

H 

tree  with  q-p  red  edges. 

Proof.  First  note  that  all  edges  of  T^-F  are  in  H,  i.e.,  none  are  deleted* 
For  if  an  edge  e  e  T^  is  parallel  to  an  edge  f,  clearly  f  /  T^ ,  since  T^ 
is  acyclic.  If  f  has  the  same  color  as  e  but  smaller  cost,  then  the 
spanninq  tree  T  -e+f  contradicts  the  definition  of  T  .  So  e  is  not  deleted. 

q  q 

Now  it  is  clear  that  T^-F  is  a  spanning  tree  of  H  containing 
q-p  red  edges.  Further,  it  is  the  smallest  such  tree.  For  if  S  is  a 
smaller  tree,  then  S+F  is  a  spanning  tree  of  G  that  contradicts  the 
definition  of  T  .  □ 

Figure  4.4  shows  H  and  T^-F,  the  smallest  spanning  tree  with  one 
red  edge.  Note  that  in  H,  the  smallest  spanning  tree  with  no  red  edges 
is  {1,3,9},  This  tree  corresponds  to  a  spanning  tree  of  G  with  two  red 
edges,  {1 ,2, 3, 6,8,9} ,  but  this  is  not  T^  =  (1 ,2, 3, 6,7 ,9} . 
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of  T 

q 


The  last  result  allows  us  to  iterate  Lemmas  4.3-4  until  all  edges 
are  deduced.  This  is  the  method  followed  by  procedure  B  given  below. 


On  entry  to  B,  the  graph  (a  multigraph)  consists  of  two  spanning 
trees,  one  green  and  the  other  red.  All  green  edges  are  incident  to  v;  red 
edges  may  or  may  not  be  incident  to  v.  A  spanning  tree  containing  q  red 
edges  is  to  be  found.  (By  assumption,  such  a  tree  exists.) 


B  adds  edges  to  a  list  F  until  it  is  the  desired  tree,  q  is 
maintained  as  the  number  of  red  edges  to  be  added. 


1. 

2. 

3. 


procedure^  B ; 
begin 


.  . . .  ••••■) 

while  the  graph  has  more  than  one  vertex  do 


beg  i  n 


for  each  green  edge  e  =  (v,w)  do 

y(e):=  the  smallest  red  edge  incident  to  w; 

let  M  =  {(e,y(e))je  is  a  green  edge}; 

let  M1  contain  the  q  smallest  swaps  of  M; 

let  M0  contain  all  but  the  2q~l  smallest  swaps  of  M; 

L 


4. 

5. 

6. 
7. 


end 


add  the  red  edges  of  swaps  of  to  F;  decrease  q  by  the  number 
of  edges  added; 

add  the  green  edges  of  swaps  of  to  F; 

contract  the  edges  of  F; 

for  each  vertex  of  the  graph  w  *  v  do 

/y* amv 

delete  all  edges  from  w  to  v  except  the  smallest  green  and 
smallest  red  edge  (if  it  exists); 

end  B : 
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When  B  is  called  with  q  -  3  in  Figure  4.2,  the  first  iteration 


forms  the  graph  of  Figure  4.4  (a);  the  second  iteration  adds  the  edges 
of  Figure  4.4  (b)  to  F,  completing  the  tree 


Note  that  in  lines  2,  3,  and  7,  ties  in  cost  c  are  broken 
lexicographically,  consistent  with  the  cost  function  c'  described  at 
the  start  of  this  section. 


Also  in  line  6,  it  is  only  necessary  to  contract  the  edges  added 
to  F  in  the  current  iteration.  As  usual,  we  do  the  contraction  by 
forming  adjacency  lists  for  the  new  multi  graph.  Further,  for  each  edge 

in  the  current  graph  we  record  the  edge  in  the  original  graph 

from  which  it  derives.  This  way  we  can  maintain  F  as  a  list  of  edges 

in  the  original  graph. 


Lemma  4.6.  Suppose  B  is  called  with  the  entry  conditions  (given  above 
B)  satisfied.  Then  B  adds  edges  to  F  that  form  the  smallest  spanning 
tree  with  q  red  edges.  B  runs  in  0(n)  time  and  space. 

Proof.  We  first  prove  correctness.  We  show  that  every  time  line  1  is 
reached,  the  following  conditions  hold:  "  r 

(i)  The  desired  tree  consists  of  F  plus  the  smallest  spanning 
tree  with  q  red  edges  in  the  current  graph. 

(ii)  The  green  edges  form  a  spanning  tree,  with  each  green  edge 
incident  to  v. 

(iii)  The  red  edges  consist  of  a  spanning  tree  and  zero  or  more 
edges  incident  to  v. 


The  entry  conditions  show  (i)-(iii)  hold  initially.  So  suppose 
lines  2-7  are  executed.  The  edges  added  to  F  in  lines  4-5  are  in  the 
desired  tree,  by  Lemmas  4.3-4.  (Note  if  q  =  0,  line  5  adds  all 
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green  edges  to  F,  as  desired).  Lines  6-7  form  the  graph  H  of  Lemma  4.5. 
Hence  after  line  7,  (i)  holds,  (ii)  is  obvious.  To  see  (iii),  first 
note  that  the  red  edges  not  incident  to  v  in  the  new  graph  are  acyclic 
(otherwise  the  original  graph  has  a  red  cycle  missing  v).  Further, 
the  red  edges  span  the  new  graph,  (iii)  follows.  This  completes  the 
induction. 

Finally  note  that  every  time  through  the  loop*  if  q  >  0  the 
iteration  decreases  q;  if  q  =  0,  the  loop  halts.  Thus  B  eventually  halts. 
Now  (i)  shows  F  is  as  desired. 

Now  we  estimate  the  efficiency  of  B.  First  observe  that  one 
iteration  of  lines  2-7  takes  time  linear  in  the  number  of  edges  (in 
the  current  graph).  Line  3  uses  a  linear-time  selection  algorithm 
[BFPRT,SPP] . 

Define  the  following  quantities,  for  i  >  1:  q.  is  the  value  of  q 
immediately  before  the  ith  iteration  of  the  whfle^loop;  g.  (r.)  is  the 
number  of  green  (red)  edges  in  the  graph  immediately  before  the  ith 
iteration.  The  following  relations  hold,  for  i  >  1: 

(1)  qi+1  *  q • / 2 , 

(2)  9i+1  *  2q.-l, 

(3)  r.  <  2gi-l. 

(1)  holds  because  the  i^h  iteration  adds  the  red  edges  of  to  F.  (2) 
holds  because  the  i^  iteration  adds  the  green  edges  of  M^.  (3)  follows 

from  inductive  assertions  (ii)  -  (iii). 

Now  the  i th  iteration  uses  time  0(q.+r.).  For  i  =  1  this  is 
0(n),  by  (ii)  -  (iii).  Otherwise  (2)  -  (3)  show  that  for  i  >  l,r  and 

gi+1  are  both  0(qi).  (i)  shows  q.  <  q1/21"1  <  n/21'1.  Hence  all  iterations 
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after  the  first  take  time  at  most  a  constant  times  l  — =  0(n). 

i=l 

This  gives  the  desired  time  hound. 

The  space  bound  is  obvious.  (Note  that  we  only  maintain 
adjacency  lists  for  the  current  graph.)  □ 

The  main  routine  for  finding  a  degree-constrained  spanning  tree 
is  similar  to  procedure  A:  First  find  trees  T^  and  T^  (lines  6-7  of  A). 
Then  check  q  for  feasibility  (line  8).  If  q  is  feasible,  decrease  its 
value  by  £,  and  place  the  l  red  edges  of  T^  in  F.  Then  delete  all  edges 

besides  LuT  ,  and  contract  T„  n  T  .  After  this  step  the  green  edges 

aL  u  'C  u 

and  the  red  edges  both  are  spanning  trees.  Finally  call  B. 

Theorem  4.2.  The  above  algorithm  finds  a  smallest  spanning  tree  with 
a  degree  constraint,  in  time  0(m  log  log  (2+m/n)n^  and  space 

Proof.  T^  and  Ty  are  found  by  using  a  minimum  spanning  tree  algorithm  on 
the  graph  with  edge  costs  appropriately  modified.  The  time  for  this 
is  0(mloglog(2+m/n)n)  [CT,  Y,T2].  The  new  graph  can  be  constructed 
from  the  0(n)  edges  in  T£  u  Ty  in  time  0(n).  B  is  0(n).  The  time 

bound  follows.  □ 

Actually  the  algorithm  can  be  implemented  with  only  one  call 
to  a  general  minimum  spanning  tree  algorithm.  First  find  T^,  the 
spanning  tree  with  the  greatest  number  of  red  edges  and  smallest  cost 
possible.  Use  the  general  algorithm  for  this.  Then  find  T^,  the 
spanning  tree  with  the  fewest  number  of  red  edges,  containing  all 
green  edges  of  T  ,  only  red  edges  of  T  ,  and  smallest  cost  possible. 

Note  that  after  the  cost  function  has  been  appropriately  modified,  this 
can  be  done  by  finding  a  minimum  spanning  tree  on  a  graph  consisting 
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of  a  spanning  tree  Tu  and  edges  incident  to  one  vertex  v.  This 
requires  time  0(n),  by  an  algorithm  originally  due  to  Spira  and  Pan: 


R  nr i  -f-h  1  i  nae 

‘-'9  "  1  I  i  


2-5  replaced  by  one  step  that  adds  the 


smallest  edge  incident  to  each  vertex  in  the  graph  to  F.  (Also,  ignore 
colors  in  line  7.)  Details  are  in  [SP,  p.  377].,  This  gives  the 
following  result: 


Corollary  4.2.  A  smallest  spanning  tree  with  a  degree  constraint  can 
be  constructed  by  modifying  costs  (in  time  0(m-)),  finding  one  minimum 
spanning  tree,  and  doing  0(n)  post-processing.  □ 


Note  that  finding  a  smallest  spanning  tree  with  a  degree  constraint 
requires  at  least  the  time  to  find  a  minimum  spanning  tree.  (We  can  find 
a  minimum  spanning  tree  for  a  graph  by  adding  a  vertex  v  with  one  edge, 
and  finding  a  smallest  spanning  tree  with  one  edge  incident  to  v.) 

Hence  the  two  problems  are  linear-time  equivalent. 
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5.  Matching  Matroids. 

This  section  discusses  our  intersection  problem  on  several  types 


of  matching  matroids.  First  the  matroids  are  defined.  Then  an 

0(m  +  n  log  n)  algorithm  is  given  for  simple  scheduling  matroids.  An 

9 

0(m  loa  n  +  n  )  algorithm  is  given  for  general  scheduling  matroids.  Finally. 

the  time  for  general  transversal  and  matching  matroids  is  shown  to  be 

(m  log  m  +  n  e) . 

We  begin  with  the  definitions.  A  matching  matroid  is  derived 

from  a  graph  G  and  a  subset  of  the  vertices  J.  The  elements  of  the 

matroid  are  the  vertices  of  J.  The  independent  sets  are  subsets  of  J 

that  can  be  covered  by  a  matching  of  G.  A  transversal  matroid  is  a 

matching  matroid  where  G  is  bipartite  and  J  is  one  of  the  two  vertex 

sets  of  G  (i.e.9  each  edge  goes  from  J  to  J). 

Two  special  types  of  transversal  matroids  are  of  particular 

interest.  A  (general)  scheduling  matroid  derives  froip  a  convex  bipartite 

graph.  More  precisely,  the  vertices  of  J  can  be  indexed  from  1  to  |J|  so 

that  each  vertex  of  J  is  adjacent  to  consecutive  vertices  a,  a+1 ,  b 

of  T  (  and  no  others).  We  think  of  the  vertices  of  J  as  jobs  and  the 

i th  vertex  of  T  as  the  time  period  from  i-1  to  i .  Thus  a  scheduling 

matroid  corresponds  to  the  following  situation:  A  processor  runs  for 

|7|  units  of  time.  There  are  m  =  |J|  jobs,  each  requiring  one  unit  of 

processing  time.  Each  job  j  has  a  release  time  r.  and  a  deadline  d., 

J  J 

both  integers,  0<r.<d.<|Jj.  If  job  j  is  chosen  for  execution, 

3  3 

it  must  be  started  no  earlier  than  time  r.  and  finished  no  later  than 

3 

d..  In  this  matroid  a  base  is  a  set  of  jobs  of  maximum  cardinality 
3 

that  can  be  executed  on  the  processor  so  that  each  job  meets  its 
constraints.  A  schedule  is  a  base,  together  with  a  specification  of 
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when  each  job  gets  executed. 

A  simple  scheduling  matroid  [Ll  pp. 265-6,  278]  corresponds  to  a 


convex  bipartite  graph  whsi 


arl  i  a  + 

uuju  vv_  I  I  u 


l,2,...,b,  for  some  b.  In  the  scheduling  interpretation,  each  release 

time  is  0,  so  release  times  can  be  ignored. 

In  the  following  discussion  we  use  interval  notation  in  two 

ways.  The  first  is  for  intervals  of  time.  Suppose  a  processor  executes 

n  unit-length  jobs,  one  after  another.  We  say  that  the  j0b  js 

"fc  h 

executed  in  the  time  interval  [ i - 1 , i ) .  (Thus  the  i  job  is  no  longer 
executing  at  time  instant  i).  The  second  use  employs  interval  notation 
for  sets  of  integers.  Thus  if  r  and  d  are  integers,  [r,d)  denotes  the 
integers  in  the  set  of  real  numbers  usually  denoted  [r,d),  i.e.,  [r ,d)  = 
{r,...,d-l}.  The  context  will  always  make  our  use  of  interval  notation 


unambiguous . 


5.1.  Simple  Scheduling  Matroids, 

This  section  discusses  the  following  scheduling  problem.  Given 
is  a  set  of  m  unit-length  jobs.  Each  job  has  an  integer  deadline,  a 
real-valued  profit  and  a  job  class  G  or  R.  The  profit  for  a 
job  is  earned  if  and  only  if  it  is  completely  executed  by  its  deadline. 

Find  a  maximum  profit  schedule  containing  exactly  q  jobs  in  class  R. 

This  problem  is  essentially  equivalent  to  our  intersection  problem 
on  simple  scheduling  matroids.  Note  that  we  will  give  an  algorithm  that 
finds  a  schedule,  not  just  an  (unordered)  base.  Also, as  usual,  wi thout 
loss  of  generality  we  find  a  minimum  (not  maximum)  cost  base  (schedule). 
An  example  problem  is  shown  in  Figure  5.1.  Jobs  are  identified  by  their 
cost  and  listed  underneath  their  deadline. 

We  represent  the  matroid  by  the  following  data  structure.  For 
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each  integer  time  t  between  1  and  the  largest  deadline,  there  is  a  list 
D  =  { j | j o b  j  has  d.  =  t}.  The  list  heads  are  in  an  array  so  that  given 
t,  the  first  job  in  D+  can  be  found  in  0(1)  time.  It  is  easy  to  construct 
this  data  structure  in  linear  time  from  any  reasonable  specification  of 
the  matroid. 

We  start  with  a  simple  normalization:  We  can  always  assume  that 
the  processor  runs  from  time  0  to  n,  and  exactly  n  jobs  are  executed. 

Lemma  5.1.  Let  M  be  a  simple  scheduling  matroid.  Then  there  is  a  simple 
scheduling  matroid  m',  where  M'  has  rank  n  =  max{d.|j  is  a  job  of  M  }, 
m‘  <  (n')^,  and  for  some  set  of  jobs  I  of  M,  I  plus  a  base  of  M  gives 
a  base  of  M  (i.e.,  M*  =M/I).  Further  if  the  jobs  have  costs,  then  I 
plus  a  minimum  cost  base  of  m'  gives  a  minimum  cost  base  of  M.  M1 
can  be  found  in  time  0(m). 

Proof.  First  we  prune  M  to  a  matroid  M1  with  all  the  desired  properties 

except  the  upper  bound  on  m.  To  do  this,  initialize  M'  to  M  and  I  to  0. 

Then  set  n'  =  maxld.fj  is  a  job  of  m'}.  Now  let  Jt  =  { j | j  is  a  job  of 

M*  and  d.  >  t}.  Suppose  there  is  a  time  t  such  that  |J. |  =  n  -  t.  Then 
J  L 

choose  t  as  large  as  possible;  remove  the  jobs  of  from  M  and  add  them 
to  I;  then  repeat  the  process  (start  by  redefining  n  ).  Otherwise  if 
there  is  no  time  t,  halt. 

The  correctness  of  this  procedure  follows  from  the  observation  that 
the  deleted  sets  Jt  are  included  in  any  base  of  M;  also,  the  final 
matroid  M1  has  rank  n'.  It  is  easy  to  implement  this  procedure  in  linear 
time.  (Note  that  if  we  seek  a  schedule  rather  than  just  a  base,  the 
assignment  of  I  to  time  periods  can  also  be  recorded  in  linear  time). 

In  the  second  modification  to  M 1 ,  for  each  d,  1  <  d  <  n',  remove 


all  but  d  smallest  jobs  that  have  deadline  d.  (If  fewer  than  d  jobs  have 

deadline  d,  do  not  remove  any.  If  jobs  do  not  have  costs,  the  choice 

of  jobs  to  remove  is  arbitrary).  It  is  easy  to  see  that  this  does  not 

chanqe  the  minimum  cost  of  a  base,  and  achieves  the  bound  m'  <  n ' ( n 1 +1 )  . 

2 

Further  this  can  be  done  in  the  required  time  by  using  a  linear- time 
selection  algorithm  [BFPRT,SPP],  □ 

For  the  rest  of  the  discussion  we  assume  the  matroid  has  been 
preprocessed  so  M  has  the  properties  given  above  for  M1. 

Now  we  discuss  finding  a  minimum  cost  schedule.  The  greedy 
algorithm  can  be  implemented  imtime  0(mlogm  +  n),  or  if  the  jobs  are 
already  sorted,  0(m a  (m,n)  + n) .  This  is  a  simple  exercise  using  the 
UNION-FIND  data  structure  [HS,  pp. 161-8]. 

The  time  bound  for  unsorted  jobs  is  improved  to  0(m+nlogn)  by 
the  following  algorithm.  Recall  Dt  =  {j|d.  =  t}.  The  algorithm  also 
uses  A,  a  priority  queue  of  jobs. 

procedure  C; 
begin 

1.  A:  =  0; 

2.  for  t:  =  n  to  1  by  -1  do 

/VVW  ,/w4-  <vw 

begin , 

3.  A:  =  A  u  Dt; 

4.  remove  the  smallest  job  from  A  (if  one  exists),  and  schedule 

it  in  time  interval  [t-l,t); 

end  end  C; 

Am 
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Lemma  5.2.  Procedure  C  finds  a  minimum  cost  schedule  for  a  simple 
scheduling  matroid  in  time  0(m  +  n  log  n)  and  space  0(m). 


. i-  ji j 


Proof.  We  start  by  showing  that  if  j  is  a  smallest  job  witn  deadline 
n,  there  is  a  minimum  schedule  with  j  executed  in  [n-l,n).  Suppose  a 
minimum  cost  schedule  executes  jobs  in  that  order.  If 

j  i  (k- , . . . ,k  },  then  the  schedule  kis...,k  1  j  has  the  desired  property 
(since  c(j)  <  c(k  )).  Otherwise  if  j  =  k. ,  the  schedule 
k1,...,ki_1Jki+1,...,kn_1,j  has  the  desired  property. 

Now  it  is  easy  to  prove  by  induction  on  n  that  C  finds  a  minimum 
cost  schedule.  To  verify  the  time  bound  for  C,  implement  A  and  , 

t  =  l,...,n,  as  mergeable  heaps,  e.g.,  2-3  trees  [ AHU,pp. 152-55] .  Then 
the  operations  of  union  and  removing  the  smallest  are  O(logm).  The 
time  bound  follows.  □ 


It  now  follows  that  procedure  A  runs  in  time  0(m  +  na  (n,n)  log  n) 

on  simple  scheduling  matroids.  The  analysis  is  similar  to  that  for 

spanning  trees.  Note  that  contraction  is  eas-y  to  do  on  a  simple 

scheduling  matroid:  Suppose  a  set  containing  k.  jobs  of  deadline  i, 

i  =  l,,..sn,  is  to  be  contracted.  Then  a  job  whose  original  deadline 

d 

is  d  gets  a  new  deadline  d  -  £  k. . 

i=l 

Now  we  give  an  algorithm  for  the  intersection  problem  on  simple 
scheduling  matroids  that  runs  in  0(m  +  nlogn)  time.  It  works  by 
computing  a  restricted  swap  sequence  from  the  definition.  We  start 
by  characterizing,  the  valid  swaps  for  a  base  B-  For  time  t  =  1,  ...,,n9 
let  b^  be  the  number  of  jobs  in  B  with  deadline  at  most  t.  The  si ack  in  B 
at  time  t  is  t-b^;  B  is  tight  at  time  t  if  t  =  b^.  (Thus  B  is  a  base  if 
and  only  if  it  has  non-negative  slack  at  times  l,...,n-l  and  is  tight 
at  n. ) 
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Lemma  5.3.  Let  B  be  a  base,  with  jobs  g  e  B,  h  i  B.  Choose  t  as  small 
as  possible  so  that  t  ^  and  B  is  tight  at  t.  Then  B-g+h  is  a  base  if  and 

A  in  '1  w  “i  -P  /  "f" 

uii  i y  «  I  ^ 

Proof.  Let  b'  =  B-g+h.  B1  is  a  base  if  and  only  if  for  s  =  1,  ...,n, 

b  <  s.  We  consider  three  cases, 
s 

If  dg  >  t,  then  bj.  =  bt  +  1  =  t  +  1,  so  b'  is  not  a  base.  If 

dg  <  d^,  then  for  any  s,  s  =  l,...,n,  b^  <  bg ,  so  B  is  a  base.  Finally 

suppose  d^  <  dg  <  t.  If  s  <  d^  or  s  >  dg,  then  b^  =  bg.  If  dh  <  s  <  dg, 

then  b'  =  b  +1 ;  the  choice  of  t  shows  b'<  s.  Hence  B  is  a  base.  □ 
s  s  s 

The  Lemma  implies  that  the  following  procedure  can  be  used  to 
find  the  best  swap  (g,h)  for  a  base  B  and  red  job  h. 

procedure^D ; 
begin ^ 

1.  find  the  smallest  time  t  >  dh  where  B  is  tight; 

2.  find  the  largest  green  job  g  £  B  with  dq  <  t; 

3.  make  (g,h)  a  swap;  B:  =  B-g  +  h; 
end  D; 

Note  that  time  t  exists,  since  B  is  a  base.  The  given  job  g  exists 
if  there  is  any  swap  for  h,  and  (g,h)  is  a  largest  swap,  by  the  Lemma. 

We  will  find  a  restricted  swap  sequence  by  iterating  D.  We  use 
the  following  data  structure  for  the  base  B:  Take  any  balanced  tree  with 
n  leaves,  e.g.,  a  complete  binary  tree.  Number  the  leaves  from  left  to 
right  as  l,...,n.The  leaves  descending  from  a  node  s  form  an  interval  of 
integers  [£,r] .  Node  s  represents  the  corresponding  time  interval,  [l- l,r). 
Node  s  has  three  data  fields :  G(s)  is  the  largest  green  job  in  B  with  a  deadline 
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in  [£,r],  S(s)  is  an  integer  used  to  compute  slacks;  more  precisely 

for  any  integer  time  t,  if  P  is  the  path  from  leaf  t  to  the  root,  then 

the  slack  of  B  at  t  is  V  S(u).  M(s)  is  the  smallest  sum  5!  S(u), 

UeP  UeP 

where  P  is  a  path  from  s  to  a  leaf  in  [£,r].  (Thus  for  instance  the  root 

has  M-value  0. ) 

Figure  5.2  illustrates  this  data  structure  for  Figure  5.1.  Each 
node  s  is  labelled  with  the  values  G(s),  S(s),  M(s).  Figure  5.2(a) 
shows  the  data  structure  for  the  initial  base  of  all  green  jobs;  Figure 
5.2(b)  shows  it  after  the  first  swap  (7,1)  has  been  made. 

Besides  the  balanced  tree,  the  green  jobs  of  B  are  organized  in 
lists:  For  each  integer  time  t,  1  <  t  <  n,  there  is  a  list  L(t)  of  all 
the  green  jobs  in  B  with  deadline  t;  L(t)  is  sorted  so  that  cost  is  nonincreasing. 

Using  this  data  structure,  it  is  easy  to  implement  D  in  time 
0(logn)  [AHU,pp. 145-152] .  For  example,  the  update  to  B  in  line  3  is 
done  as  follows: 

comment  set  B  to  B  -  g  +  h; 

3.1  remove  g  from  L(dgj; 

if  L(dn)*0  then  G(dn):=  the  first  job  in  L(d_) 

elj>e_  G(dg)  :=0  comment, 0  is  a  dummy  job  with  cost  -  °°; 

3.2  increase  S(d  )  and  M(d  )  by  1; 

y  y 

for  each  node  s  on  the  path  from  leaf  d  to  the  root  do 

A*'®*'* 

beg  i  n_ 

if^s  has  a  right  brother  r  therMncrease  S(r)  and  M(r)  by  1; 
if  s  ^  dn  then  begin 

let  G(s)  be  the  job  with  the  largest  cost  in  {G(s')|s'is 
son  of  s}; 

M(s):=  S(s)  +  min{M(si)|s'  is  a  son  of  s}; 
end  end; 
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3.3 


decrease  S(d^)  and  M(d^)  by  1; 

for  each  node  s  on  the  path  from  leaf  d,  to  the  root  do 
^  r  n 


beqi  n 


if  s  has  a  right  brother  r  then  decrease  s(r)  and  M(r)  by  1; 

am* 

M(s)  :=  S(s)  +  mi n{M( s ' ) | s '  is  a  son  of  s}; 


end; 

AWl 


The  remaining  details  of  procedure  D  are  left  to  the  reader.  Now 
suppose  we  are  given  bases  B  and  R  of  all  green  and  all  red  jobs 
respectively.  A  restricted  swap  sequence  for  B  and  R  can  be  found  by 
sorting  the  jobs  in  B  and  in  R,  constructing  the  data  structure  for  B, 
and  iterating  procedure  D  n  times.  This  gives  the  following  result. 

Lemma  5.4.  A  restricted  swap  sequence  for  two  bases  can  be  found  in 
time  O(nlogn)  and  space  0(n).  □ 


The  complete  algorithm  for  our  intersection  problem  follows 
procedure  A:  First  find  bases  B£  and  Bu  (lines  6-7).  Then  check  q  for 
feasibility  (line  8).  Find  a  restricted  swap  sequence  as  in  line  §,  only 
using  the  procedure  given  above.  As  in  lines  10-11,  form  the  desired 
base  Bby/making  the  q  smallest  swaps.  Finally  construct  a  schedule  for  B 
in  linear  time,  as  follows:  Schedule  jobs  of  B  from  the  first  time  slot 
to  the  last,  always  choosing  the  job  with  smallest  deadline  to  be  executed 
next. 

Theorem  5.1.  For  simple  scheduling  matroids,  a  smallest  schedule  with 
q  red  jobs  can  be  found  in  time  0(m+n  log  n)  and  space  0(m).  □ 


Note  that  our  time  bound  is  the  same  as  the  best  known  method  for 
finding  a  minimum  cost  base.  (The  time  needed  to  do  the  latter  is 
clearly  a  lower  bound  on  the  time  for  our  intersection  problem.)  We 
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can  also  show  that  our  algorithm  is  the  best  possible  implementation  of 
the  swap  sequence  approach.  This  is  done  below  in  Corollary  6.1. 

5.2.  General  Scheduling  Matroi d s . 

Now  we  treat  the  general  scheduling  problem,  where  each  job  j 
has  a  release  time  r^  and  a  deadline  d. .  Again  we  seek  a  smallest  schedule 
with  q  red  jobs. 

We  start  by  normalizing  the  matroi d. 

Lemma  5.5.  Let  M  be  a  general  scheduling  matroid.  Then  there  is  a 
general  scheduling  matroid  M‘ ,  where  M1  has  rank  n'  =  ma x { d j | j  is  a  job 
of  m'},  m'  <  (n')3,  and  a  base  of  m'  is  a  base  of  M.  If  the  jobs  have 
costs,  a  minimum  cost  base  of  M*  is  minimum  for  M.  M '  can  be  found  in 
time  0(m) . 

Proof.  First  we  find  M*  satisfying  all  conditions  except  the  upper  bound 
on  m' .  Initialize  M1  to  M.  Without  loss  of  generality,  for  n'  = 
maxldj | j  is  a  job  of  M ' } ,  n'  >  m';  also  some  job  has  release  time  0. 

Now  do  the  following: 

1.  let  l  =  m  i  n  { 1 1 1  is  an  integer  and  t  >  |  { j  |  job  j  has  rx  t  }  | } ; 

2.  let  h  =  mi n{ t | t  is  an  integer,  t  >  l-l  and  some  job  j  has  r.  =  t} 

u  { n ' } ; 

3.  contract  the  time  interval  [£-l,h),  i.e.,  set  all  deadlines  that  are 
in  [£-l,h)  to  l- 1,  and  decrease  all  release  times  and  deadlines  that 
are  h  or  more  by  h-£+l; 

Note  that  in  line  1  £  is  well-defined  since  the  time  n 1  is  in  the  set. 
In  line  2,  h  >  £,  since  line  1  implies  no  job  has  release  time  l-l. 

Thus  line  3  contracts  a  nonempty  time  interval. 
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Now  we  show  that  any  schedule  can  be  transformed  so  that  it  does  not 
use  the  time  slots  that  are  contracted  in  line  3.  Given  a  schedule  S 

for  M.  let  S'  be  a  schedule  executing  the  same  jobs  as  S  but  as  early 

as  possible;  more  precisely,  if  [t.-l9t.. ),  i  =  l,  k  are  the  times 

when  jobs  are  executed  and  t^  <  t^ - . - <tk ,  then  the  vector  ( ,t2, . . . »tk) 
is  lexicographically  minimum.  Let  u  be  the  highest  time,  u  <  £,  where 
[u-l,u)is  idle  in  S'.  Note  u  exists  by  line  1.  We  claim  u  =  £. 

To  see  this,  suppose  on  the  contrary  that  u  <  £.  The  £- u  jobs 
that  s'  schedules  in  [u ,£)  have  release  times  at  least  u  (else  s' 
could  be  improved).  Line  1  implies  there  are  at  least  u  jobs  with 
release  times  less  than  u.  This  gives  at  least  (£- u)  +  u  =  £  jobs  with 
release  times  less  than  £,  a  contradiction. 

Hence  u  =  l.  This  implies  that  no  job  with  release  time  less 
than  £  is  scheduled  after  time  l  (otherwise  S1  can  be  improved).  Now  it 
is  easy  to  see  from  line  2  that  no  job  is  scheduled  in  [£-l ,h) .  Thus 

the  contraction  in*  line  3  does  not  change  the  bases  of  the  matroid. 

Now  to  i nsure  that  n '  <  m '  ,  repeat  the  above  three  steps  unti 1 
the  desired  condition  holds.  (Note  that  each  repetition  decreases  the 
value  n').  However  in  line  1,  always  consider  only  new  jobs,  i.e., 
if  k  is  the  previous  value  £-1,  the  new  value  of  £  is 
£  =  mi n{ 1 1 1  is  an  integer  and  t-k> | (job  j  has  k<r.<t}|}. 

The  above  procedure  can  be  implemented  in  time  0(m),  if  the  jobs 
are  sorted  by  release  time.  (In  line  3,  it  is  only  necessary  to  modify 
a  job's  release  time  and  deadline  once).  Further,  we  can  keep  track  of 
the  modifications,  so  that  a  schedule  in  M*  can  be  converted  to  a  schedule 
in  M. 

Finally  we  modify  M1  to  achieve  the  upper  bound  on  m'.  For  every 
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nonempty  set  of  the  form  { j | r<r  .<d  .<d} ,  delete  all  but  the  d-r  smallest 

J  J 

jobs  from  M  .  Using  bucket  sorts  and  linear-time  selection,  this  step  is 

0(m) .  □ 

In  the  remainder  of  the  discussion  we  assume  M  has  been  modified  as 
in  the  Lemma. 

The  greedy  algorithm  on  scheduling  matroids  can  be  implemented  in 

p 

time  0(mlog  m  +  n  ).  Lipski  and  Preparata  [LP]  obtain  a  bound  of  0(mn) 
using  matchings  and  augmenting  paths.  Their  method  can  be  modified  to 
achieve  the  above  time  bound.  Here  we  give  a  method  based  on  Glover's 
algorithm  for  matching  convex  graphs  [GI],  also  called  the  earliest  deadline 
rule  for  scheduling  [LF,J]. 

The  earliest  deadline  rule  finds  a  schedule  for  a  given  set  of  jobs 

if  one  exists  [LF,J].  Call  a  job  j  available  for  time  t  if  it  is  not 

scheduled  in  [0,t-l)  and  r.  <  t  <  d..  The  earliest  deadline  rule  is  as 

J  J 

follows : 

for  t:  =  1  to  n  do 

if  some  job  is  available  for  t  then 
in  [t-l,t),  schedule  a  job  that  is  available  for  t  and 
has  smallest  possible  deadline; 

We  call  any  schedule  that  can  be  constructed  by  this  rule  an  earl i est 
deadline  schedule.  By  convention,  during  any  time  interval  [t-l,t)  that 
the  machine  is  idle  we  say  it  is  executing  a  dummy  job  0,  where  dQ  =  n  +  1. 

The  greedy  algorithm  works  by  iterating  the  following  step:  Given 
an  unscheduled  job  x  and  scheduled  jobs  S,  if  S  +  x  can  be  scheduled 
then  add  x  to  S.  (The  jobs  x  are  considered  in  order  of  nondecreasing 
cost) . 
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We  maintain  an  earliest  deadline  schedule  for  S.  To  gain  efficiency, 
we  delete  certain  "tight"  time  intervals,  where  the  schedule  cannot  change. 

A  time  interval  [r , d)  is  tight  if  d-r  =  |{j|j  is  currently  scheduled  and 
r<r.<d.<d}|.  The  following  data  structure  keeps  track  of  time  intervals 
[ t-1 , t )  that  are  not  in  tight  intervals  and  hence  not  deleted. 

The  time  intervals  [t-1 , t )  that  are  not  in  tight  intervals,  where 
t  is  an  integer,  0  <  t  <  n  +  1,  are  maintained  in  a  doubly-linked  list  T 
in  ascending  order.  A  node  p  on  T  has  three  fields.  TIME(p)  is  the 
value  t  for  interval  [ t-1 , t ) ,  SUCC(p)  is  a  pointer  to  the  next  node  on  T, 
if  it  exists,  and  PRED(p)  is  a  pointer  to  the  preceding  node, if  it  exists. 
Thus  TIME(p)  <  TIME(SUCC(p) )  if  SUCC(p)  exists.  (Note  that  the  first  and 
last  nodes  of  T,  with  TIME  fields  0  and  n  +  1,  are  dummies). 

In  addition,  the  times  0,  ...,  n  are  partitioned  into  disjoint 
sets.  For  each  time  t  on  T,  there  is  a  set  =  {s|0<s<n,  and  t  is  the 
lowest  time  in  T  with  s<t}.  These  sets  are  manipulated  by  UNION  and  FIND 
instructions  [AHU],  UNI0N(Ss,St)  merges  set  Sg  into  S ^ ,  thereby  eliminating 
S  .  FIND(s)  has  a  value  that  is  a  pointer  to  a  node  of  T :  if  s  £  S^,  then 
TIME ( FIND( s ) )  =  t. 

The  algorithm  also  uses  a  list  C  of  changes  to  the  current  schedule. 
The  algorithm  attempts  to  construct  an  earliest  deadline  schedule  for 
S  +  x.  All  changes  needed  in  the  current  schedule  are  recorded  in  C. 

If  S  +  x  can  be  scheduled,  then  the  changes  of  C  are  made;  otherwise  they 


are  not  made. 
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procedure  E 


comment  given  is  an  earliest  deadline  schedule  for  a 
set  of  jobs,  and  an  unscheduled  job  x.  E  adds  x  to  the 
earliest  deadline  schedule,  if  this  is  possible.  Otherwise 
it  may  delete  a  tight  interval  from  T; 


begin 

1.  C:  =  0;  u:  =  x;p:  =  FI ND ( r  ) ;  t:  =  r:  =  TIME(p); 

X 

2.  while  u  *  0  and  t  <  d, ,  do 

Aa*a**~  U  ***** 

begin^ 

3.  let  j  be  the  job  scheduled  in  [ t- 1 , t )  commer^  j  may  be  0; 

4.  if  d  <  d .  then 

5.  begin  in  list  C,  schedule  u  in  [t-1 , t) ;  u:  =  j  end; 

6.  p:  =  SUCC(p) ;  t:  =  TIME(p); 


end; 

A'''-'1* 

7.  if  u  =  0  then  update  the  schedule  by  making  the  changes  in  C 


8. 

9. 


else 


begin 


q:  =  PRED(p);  s:  =  TIME(q) ;  r:  =  r-1, 
while  s  >  r  do 


begim 

10.  let  j  be  the  job  scheduled  in  [s-l,s); 

11.  if  r.  <  r  then  r:  =  r.; 

-W*  J  J 

12.  UNION ( s , t) ; 

13.  q:  =  PRED(q);  s;  =  TIME(q) ; 
end  end  end  E; 
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Lemma  5.6.  The  greedy  algorithm,  using  procedure  E,  finds  a  minimum  cost 

?  2 
schedule  in  time  0(m  log  m  +  n  )  and  space  0(m) .  {The  time  is  0(ma(m,n)+n  ) 

if  the  jobs  are  given  in  sorted  order). 

Proof.  Assume  that  x  is  the  next  job  to  be  considered  by  the  greedy  algorithm 
and  that  an  earl iest  deadl i ne  schedul e  has  been  constructed  for  the  correct 

subset  of  jobs  larger  than  x.  Further,  assume  that  list  T  and  sets 
are  correct.  This  means  that  if  s  and  t  are  consecutive  times  on  T,  then 
integers  [s,t)  form  a  set  and  [s,t-l)  is  a  tight  interval.  We  show  that 
E  processes  x  correctly,  i.e.,  an  earliest  deadline  schedule  for  S  +  x 

is  constructed  if  possible;  otherwise  T  and  are  updated  correctly . 

Cl  early  thi s  suffices  to  show  that  the  greedy  al  gori thm  itself  works  correctly. 
We  begin  by  analyzi ng  the  loop  of  1 i nes  1-6.  Let  C  be  the  schedule 

derived  from  the  current  one  by  making  the  changes  in  list  C.  We  show  by 
induction  that  each  time  line  2  is  reached,  one  of  the  following 
alternatives  holds: 

(i)  u  =  0,  in  which  case  Cisan  earliest  deadline  schedule  for  S  +  x. 

(ii)  t>du,  in  which  case  it  is  impossible  to  schedule  al  1  jobs  of  S  +  x. 

(iii)  u*0  and  t<d  ,  in  which  case  C  is  an  earliest  deadline 
schedule  for  S+x  over  the  time  interval  [0,t-l);  the  jobs  remaining  to 
be  scheduled  in  [t-l,n)  are  u  plus  the  jobs  in  [t-l,n)  in  the  current 
schedule. 

For  the  base  case,  u  =  x.  On  the  time  interval  [0,r  )  it  is  clear 

X 

that  C  is  an  earliest  deadline  schedule  for  S+x.  Thus  if  t  =  r  +  1,  then 

X 

alternative  (iii)  holds  at  1 i ne  2.  Otherwise  suppose  [r  ,r  +i)  is  in  a 

X  X 

tight  interval  [s ,t-l)  (for  some  s) .  If  t  >  d  ,  then  x  and  all  jobs  in 

x 

[s,t-l)  have  deadline  <  t-1.  Thus  the  earliest  deadline  rule  does  not 
schedule  one  of  these  jobs ,  and  (ii)  holds .  Otherwise  if  t  <  d  ,  then 

X 
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si  nee  all  jobs  currently  in  [s,t-l)  have  deadline  less  than  d  ,  (iii)  holds. 

For  the  inductive  step,  assume  (iii)  holds  when  line  2  is  reached. 

We  show  that  one  of  (i)— (iii)  holds  after  lines  3-6  are  executed.  It  is  clear 
that  lines  3-5  update  C  and  u  correctly  for  the  Interval  [0 ,t) .  Now  If 
u  =0,  (i)  holds.  Otherwise,  reasoning  as  above,  (ii)  holds  if  t  >  d^ 
and  (iii)  holds  if  t  <  d  .  This  completes  the  induction. 

It  is  easy  to  see  that  E  is  correct  if  the  loop  halts  with  u  =  0. 
Otherwise,  if  u  *  0,  from  (ii)  we  need  only  show  that  lines  8-13  update  the 

schedule  correctly. 

Let  r^  be  the  value  of  r  computed  in  line  8.  Consider  the  jobs 
of  the  original  schedule  in  [>Q,t-l) .  These  jobs  all  have  deadline 

£  t  -  1.  For  let  the  tight  interval  ending  at  t  -  1,  if  any,  be  [s ,  t  -1) 

(for  some  s).  Jobs  in  [s,  t-1)  have  deadline  <  t-1.  Lines  4-5  show 

that  jobs  in  the  remainder  of  [rg,t  -  1 )  have  deadline  <  d^<t. 

Take  4  <  t  -  1  as  high  as  possible  such  that  a  job  with  deadline  >  t  -  1 

(or  job  0)  is  scheduled  in  [4-1,4).  (If  no  such  4  exists,  take  4=0.) 

The  earliest  deadline  rule  implies  all  jobs  in  [4, t-1)  have  release  times 

>  4.  Hence  [4, t-1)  is  tight.  The  preceding  paragraph  shows  4  <  r^.  Now 

take  ft  as  large  as  possible  such  that 

ft  =  min{r0>  u  (r.|  job  j  is  scheduled  in  [/l , t-1 ) } . 

Clearly  ft  >  4  and  [ft, t-1)  is  tight.  Further,  if  [ft' , t ' )  is  tight,  with 

ft'<ft<t  <t-l,  then  [/l  ' ,  t  - 1 )  is  tight. 

It  is  an  easy  matter  to  check  that  the  loop  of  lines  9-13  halts  with 

r  =  fi9  and  further  that  it  updates  T  and  the  sets  for  the  tight  interval 

[ft' ,  t-1 ) .  Thus  the  greedy  algorithm  works  correctly. 

Now  we  estimate  the  time  for  the  greedy  algorithm.  The  initial  sort 

of  m  jobs  by  cost  is  0(mlogm).  The  set  merging  instructions  do  m 

FINDs  (line  1)  and  <  n  UNIONS  (line  12)5  thus  the  time  for  set  merging 

2 

is  0(m a (m,n) ) .  We  show  that  the  remainder  of  the  processing  is  0(n  ). 
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Exactly  n  jobs  get  scheduled.  For  each  of  these  jobs,  lines  1-7  are 

2 

executed  in  0 ( n )  time.  Thus  0(n  )  time  is  spent  on  the  scheduled  jobs. 

Now  we  show  that  0(m)  time  is  spent  on  the  jobs  x  that  do  not  get  scheduled. 

For  these  jobs,  it  is  clear  from  previous  remarks  that  each  of  the  times 
t  processed  in  lines  2-6  it  also  processed  (as  s)  in  lines  9-13.  We 
charge  an  iteration  of  lines  9-13  to  the  time  processed,  s.  Since  there 
is  at  most  one  iteration  for  a  given  s,  the  charge  to  a  given  s  is  0(1). 

The  desired  bound  follows.  □ 

The  techniques  used  in  E  can  be  applied  to  find  a  restricted  swap 

sequence.  Consider  a  given  earliest  deadline  schedule  and  an  unscheduled 

red  job  h.  To  find  a  largest  swap  (g,h)  for  h,  select  d  as  small  as 

possible  and  r  as  large  as  possible  so 

d  =  max{d^}  u  {d.-|  j  is  scheduled  in  [r^,d)}, 

r  =  min{r,}  u  {r.|  j  is  scheduled  in  [r,d)}. 
n  j 

We  claim  that  g  is  a  largest  green  job  scheduled  in  [r,d). 

To  see  this,  first  note  that  [r,d)  is  tight,  as  in  Lemma  5.6.  Thus 
any  valid  swap  has  its  green  job  in  [r,d).  Next  note  that  swap  (g,h)  is 
valid.  For  if  g  is  in  |>h,d),  an  earliest  deadline  schedule  with  g 
swapped  for  h  is  constructed  by  a  procedure  similar  to  E  (as  usual  assume 
we  start  with  an  earliest  deadline  schedule): 

1.  replace  g  by  the  0  job; 

2.  s:  =  rh;  u:  =  h: 

3.  while  u  *  0  dcy 

begi_n_ 

4.  let  j  be  the  first  job  scheduled  in  [s,d)  with  d.>d  ;  let  j  be 

scheduled  in  [t-l,t);  J 

5.  in  [t-l,t)  schedule  u;  u:  =  j;  s:  =  t; 
end; 
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Otherwise  if  g  is  in  [r,r^)s  the  following  algorithm  constructs  an  earliest 
deadline  schedule  with  g  swapped  for  h: 


1 

1 . 

i  _ j-  „  i „_[ _ i , . t i  4 ~  i  ~  \  . 

1  C  U  g  UC  3UIICUUICU  111 

2. 

while  s  <  r,  do 

begij^ 

3. 

let  j  be  the  first  job  scheduled  in 

[s ,d)  with  r.<s-l;  let  j  be 

vJ 

scheduled  in  [t-l,t); 

4. 

in  [s-1 ,s )  schedule  j;  s:  =  t; 

end; 

5. 

add  h  to  the  schedule,  using  lines  2-5 

of  the  previous  algorithm; 

Thus  to  compute  a  restricted  swap  sequence  for  red  jobs  h^, 
hn  and  base  HQ,  iterate  the  following  steps:  For  h . ,  compute  r  and  d  as 
above;  find  the  largest  green  job  g ^  in  [r,d);  find  an  earliest  deadline 
schedule  for  H. ,  as  above.  Since  one  iteration  is  0(n),  the  time  for  the 
restricted  swap  sequence  is  0(n  ). 

Now  we  sketch  the  complete  algorithm  for  our  intersection  problem. 
We  modify  procedure  A.  First  find  a  smallest  base  with  the  minimum 

number  of  red  jobs  (line  6  of  A).  To  do  this  use  the  greedy  algorithm, 
with  all  green  jobs  considered  before  any  red  job.  Similarly  compute 
(line  7).  The  test  of  line  8  is  as  in  A.  The  restricted  swap  sequence 
is  found  as  above.  Then,  as  in  lines  10-11,  the  desired  base  B  is  found. 
Finally,  the  desired  schedule  is  formed.  This  last  step  can  be  done  using 
the  earliest  deadline  rule  in  time  O(nloglogn)  [EKZ]  or  more  efficiently 
in  time  0(n  a  (n,n) )[LP] . 

Theorem  5.2.  For  general  scheduling  matroids,  a  smallest  schedule  with  q 
red  jobs  can  be  found  in  time  0(mlogn  +  n  )  and  space  0(m).  □ 
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A1 though  we  cannot  prove  a  matching  lower  bound  for  this  problem, 
note  that  the  time  is  the  same  as  for  the  greedy  algorithm. 

5.3.  Matching  and  Transversal  Matroids. 

We  close  Section  5  by  briefly  examining  the  general  case  of  matching 
and  transversal  matroids.  First  we  need  some  notation.  Recall  thatn  denotes  the 
number  of  vertices  in  a  base;  m  denotes  the  number  of  vertices  in  the  set 
J  of  vertices  in  the  matroid.  We  need  one  more  parameter  to  specify  the 
size  of  the  matroid:  e  denotes  the  number  of  edges  in  the  graph. 

The  greedy  algorithm  on  matching  matroids  can  be  implemented  in 
time  0(mlogm  +  ne):  The  time  to  sort  the  vertices  is  O(mlogm).  To 
test  if  a  vertex  can  be  added  to  an  independent  set,  we  use  the  method 
of  augmenting  paths.  Using  the  techniques  of  [Ga2,KM]  for  cardinality 
matching,  the  test  can  be  done  in  time  0(e).  As  in  the  general  scheduling 
algorithm,  we  delete  vertices  that  are  reached  in  unsuccessful  tests 
(these  are  "Hungarian  vertices"  [Ga2]) .  This  makes  the  unsuccessful  tests 
use  a  total  of  0(m+e)  time,  while  the  successful  ones  use  0(n  e) .  The 
time  bound  follows. 

Again  using  augmenting  paths,  a  restricted  swap  sequence  can  be 
computed  from  the  definition,  in  time  0(ne).  This  completes  a  sketch  of 
the  following  result: 

Theorem  5.3.  For  matching  and  transversal  matroids,  a  smallest  base  with 
q  red  elements  can  be  found  in  time  0(mlogm  +  ne)  and  space  0(m+e).  □ 

As  usual  our  time  bound  matches  the  best  known  bound  for  finding 


a  minimum  cost  base. 
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6.  Partition  Matroids. 

This  section  discusses  our  intersection  problem  on  partition  matroids. 
First  we  prove  an  Q(nlogn)  lower  bound  for  the  swap  sequence  approach.  This 
bound  extends  to  scheduling  and  matching  matroids.  Then  we  give  an  0(m) 
algorithm  for  partition  matroids. 

We  start  by  defining  the  matroids.  A  partition  matroid  derives  from 
a  set  E  of  m  elements  that  is  partitioned  into  disjoint  subsets 
E. ,  i=l,  . ..,  b,  and  positive  integers  n^. ,  i=l,  . ..,  b.  The  elements  of 

the  matroid  are  those  of  E;  a  base  is  a  set  containing  exactly  n .  elements 

I 

of  E.,  1=1.  ....  b.  [LI ,  p.  272]  If  b  =  1.  the  matroid  is  called  uniform. 

The  greedy  algorithm  on  partition  matroids  finds  a  minimum  cost  base 
in  time  O(mlogm).  This  is  easily  improved  to  0(m)  by  using  a  linear¬ 
time  selection  algorithm. 

A  restricted  swap  sequence  can  be  constructed  from  the  definition 
in  time  O(nlogn).  Thus  following  procedure  A  our  intersection  problem 

can  be  solved  in  time  0(m+nlogn).  Before  improving  this,  we  show  that 
it  is  the  best  one  can  do  using  the  swap  sequence  approach.  Recall  from 
Section  4  that  sort  (n)  is  the  time  needed  for  an  algorithm  to  sort  n 

numbers.  Now  consider  an  algorithm  on  partition  matroids.  Say  it  "uses 
the  swap  sequence  approach"  if,  given  a  uniform  matroid  consisting  of 
two  bases,  one  green  and  the  other  red,  the  algorithm  finds  the  swaps 
in  a  swap  sequence  for  the  two  bases.  (As  usual,  the  algorithm  need 
not  find  the  order  of  the  swaps.) 

Lemma  6.1.  Any  algorithm  on  partition  matroids  that  uses  the  swap 
sequence  approach  requires  time  Q(sort(n)). 

Proof.  As  in  Lemma  4.1,  it  suffices  to  give  a  procedure  that  sorts  n 


numbers  by  calling  the  algorithm  and  doing  0(n)  extra  processing. 
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Consider  n  arbitrary  numbers  x^,  . ..,  xn*  The  corresponding  uniform 
matroid  consists  of  n  green  elements  costing  1,  . ..,  n  and  n  red  elements 

rn<;tinn  y  y  Tf  thp  niimharc:  in  nnndprroacinn  nnHon  sfo  \/  V  , 

j  -•]_  -  -  •  -  >  '*n-  -  ■  .  '  y 

it  is  easy  to  see  that  the  swap  sequence  is  (n,y^),  (n-l^),  ...»  (l9yn). 

The  procedure  forms  the  matroid  and  calls  the  algorithm.  Then  it 
computes  the  array  S,  where  S(i)  is  the  element  paired  with  i  as  a  swap. 

Last,  it  outputs  S(n),  S(n-l),  S ( 1 )  as  the  sorted  order. 

Only  0(n)  time  before  and  after  the  algorithm  is  used.  □ 

Note  that  a  uniform  matroid  is  a  special  case  of  a  simple  scheduling 
matroid  (where  all  jobs  have  deadline  n).  Thus  for  the  obvious  definition 
of  "swap  sequence  approach"  the  following  corollary  holds: 

Corollary  6.1.  Any  algorithm  on  simple  scheduling  matroids  that  uses  the 
swap  sequence  approach  requires  time  ft(sort(n)).  The  same  holds  for 
general  scheduling,  transversal  and  matching  matroids. 

Now  we  present  the  linear  algorithm.  It  is  a  straightforward  adaptation 

+4i  I 

of  an  efficient  algorithm  for  selecting  a  q  'smallest  element  of  a 
multiset  X  +  Y.  Here  X  and  Y  are  sets  of  n  real  numbers  and  X  +  Y  is 
the  multiset  {x+y ] xeX,y£Y}.  We  use  the  selection  algorithm  of 
Jefferson,  Shamos  and  Tarjan  [S,pp. 256-8], which  runs  in  time  O(nlogn). 

The  well-known  algorithm  of  [JM]  also  uses  time  0(n  log  n)  but  apparently 
does  not  adapt  to  our  problem. 

In  a  partition  matroid  it  is  easy  to  describe  a  swap  sequence  for 

B  and  R:  Choose  some  consistent  rule  to  break  ties  in  cost.  Then  the 

t  h 

swaps  in  a  swap  sequence  are  t.j  { (e  .  ,f . )  |  l<j<n . ,  e  .  is  the  j  largest 

C  J  J  ‘  J 

i 

t  h 

green  element  of  n  B  and  f.  is  the  j  smallest  red  element  of  E.  -  B, 
if  both  exist}.  A  swap  sequence  is  formed  by  sorting  this  set  in  order 


of  nondecreasing  cost. 
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To  get  a  linear  algorithm  we  cannot  find  the  swaps  individually. 
Instead  we  repeatedly  select  a  small  number  of  swaps,  all  known  to  be 
either  among  the  n  smallest  swaps  or  not  among  them.  In  the  first  case 
the  swaps  are  executed;  in  the  second  case  the  elements  of  the  swaps 


are  deleted.  More  precisely,  if  q  <|j  ,  we  find  swaps  that  are  larger 
than  the  median  (i.e.,  the  j  th  smallest  swap);  these  swaps  are  deleted. 
If  q  >  ~  we  find  swaps  that  are  smaller  than  the  median  and  execute  them. 

Now  we  sketch  the  complete  algorithm.  It  follows  lines  6-9  of 


procedure  A.  First  we  find  B^,  a  smallest  base  with  as  few  red  elements 
as  possible.  Note  that  for  each  block  n  E^.  consists  of  the  n^ 

smallest  green  elements,  or  if  only  h  <  n .  green  elements  exist,  all 
green  elements  plus  the  n.-h  smallest  red  elements.  We  can  find  in 
Q(m)  time  using  linear-time  selection.  The  desired  base  B  is  initialized 


to  B^. 

Similarly  we  find  B  in  0(m)  time  and  check  q  for  feasibility. 

u 

Then  we  pass  to  the  matroid  M  -  13^  u  B^ ) / ( B^nB u ) :  Elements  not  in 

B„  u  B  are  deleted.  An  element  e  e  Bn  n  B,,  is  contracted  as  follows. 

I  u  z  u 

e  is  deleted  from  the  matroid;  n  and  n. ,  where  e  e  E^. ,  are  decreased 
by  1;  if  e  is  red,  q  is  decreased  by  1.  After  the  matroid  is  modified, 


there  are  2n  elements,  forming  bases  of  all  green  and  all  red  elements. 

The  last  step  is  finding  the  q  smallest  swaps  (in  the  reduced  matroid), 


and  executing  them  on  B.  This  gives  the  desired  base.  The  procedure 


for  this,  outlined  above,  is  now  stated  in  precise  form.  For  conciseness, 
we  treat  the  two  cases  q  < 

n  i 

second  case  are  always  given  in  parentheses,  as  in  q  <  (>)  •  In  the 

.  t  h 

algorithm  below,  recall  that  for  a  block  E. ,  the  j  smallest  swap 


J-J  and  q  >  j  together:  The  changes  for  the 
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th  th 

consists  of  the  j  largest  green  element  of  E.  n  and  the  j  smallest 

red  element  of  E.-B^. 


pro£edure^  F ; 
begi^ 

1.  while  q  >  0  do 


begi  n 


let  q  <  (>) 


for  each  nonempty  block  E.  do 

J  &***< 

begi  n 

V  =PKI;  =  l^J+1; 

let  (e.,f. )  be  the  k^h  smallest  (largest)  swap  for  ; 


let  ( e  9 fp)  be  the  smallest  (largest)  swap  such  that  for 
I  =  {i|c(ei,fi)  <  (2)  c(ep,fp)}  I  ki  >  [j-J  ; 
for  each  nonempty  block  E.  do 
if  c(e. ,f. )  >  (<)  c  (e  ,f  )  then 

l|  LI  LJ 

begin 


9. 

let  ((e  .  ,f  .)  |  j  £  J}  be 

J  vJ 

10. 

for  j  e  J  do  delete  e. 

AM^A  AAA-  J 

11. 

>  [ij  hhl 

begin 

12. 

for  j  e  J  do  B:  =  B 

13. 

q:  =  q  -  h . ; 

end; 

14. 

ni :  =  ni-hi;  n:  =  n-hi 

J  J 


end  end  end  F; 

^jVH/ 
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To  insure  the  algorithm  works  correctly  we  must  break  ties 

consistently.  Assume  all  elements  have  been  indexed.  If  two  swaps ^ 

have  equal  cost,  we  say  the  one  whose  red  element  has  smaller  index  is 
smaller. 


Theorem  6.1.  For  partition  matroids,  a  smallest  base  with  q  red  elements 
can  be  found  in  0(m)  time  and  space. 

Proof.  From  the  remarks  above,  it  suffices  to  show  that  procedure  F  is 

correct  and  uses  0 ( n )  time  and  space. 

We  start  with  correctness.  First  note  that  in  line  6,  the  swap 
b 

(e  >f  )  exists,  since  J  ki  > 


P  P  i  =  l 

Now  suppose  q  < 


"3 

.  S  n 

4  n 

£ 

L2_ 

.  The  number  of  swaps  with  cost  <  c(e  ,fp) 


is  >  |Jjr j  >  by  line  6.  Thus  any  swap  with  cost  >  c(ep,fp)  is  not  among  the 

t  h 

q  smallest,  and  can  be  deleted.  Note  that  the  h.  largest  swap  is  the 


n.  +  1 


h. 

i 


3 

7T  n- 

4  i 


th 


smallest.  Thus  all  swaps  deleted  in  lines  7-14 


have  cost  >  c(e^.  9f.. )  >  c(e  ,fp)  as  desired 


Next  suppose  q  >|J- 


Reasoning  as  above,  the  number  of  swaps 

with  cost  >  c(e  ,f  )  is  >  1  .  Thus  any  swap  with  cost  <  c(e  ,f  )  can 

P  P  Uh  <. m  "  P  P 

be  executed,  and  lines  7-14  execute  the  correct  swaps. 

Hence  each  iteration  of  F  deletes  or  executes  the  correct  swaps. 
Since  n  always  decreases,  eventually  the  loop  halts  with  B  as  the 
desired  base. 

Now  we  discuss  the  timing.  We  start  by  observing  that  one 
iteration  of  lines  2-14  is  0(n)  (where  n  is  its  current  value,  i.e., 
the  number  of  elements  currently  in  a  base):  First  note  that  we  maintain 
a  list  of  nonempty  blocks  E.  for  use  in  lines  3,  6  and  7.  Let  b  be 
the  number  of  blocks  in  this  list.  Clearly  b  <  n.  Line  5  uses  a 


-63- 


linear-time  selection  algorithm  [BFPRT,SPP].  This  makes  lines  3-5 
0(b+n)  =  0(n).  Line  6  is  0(b)  =  0(n)  if  a  linear-time  weighted  median 
algorithm  [JMl  is  used.  Similarly  lines  7-14  are  0(n). 

n 

Next  we  show  that  each  iteration  of  lines  2-14  reduces  n  by  at  least  yg-. 


In  a  given  iteration,  let  D  index  the  blocks  where  deletions  are  made, 
i.e.,  D  =  {i I  in  line  8,  c(e. ,f.)  >  (<)  c(en,f  )}.  Jhen  n  decreases  by 

11  P  r 

T  h.  >  y  ni  .  Note  that  V  k.  <  1  5- 1  by  1 ine  6.  Thus 
i  eD  1  i  eD  T  '  l7D  1  2j 


I  ni  >  I 

ieD  1  ieD 


i  ki  -  i  ^  >-  i 

i=l  iVD  1  4 


n  =  n 

2  4* 


n 

These  Inequalities  imply  n  decreases  by  at  least yg-  as  desired.. 


Finally  let  nQ  be  the  value  of  n  in  the  initial  iteration  of 


lines  2-14.  In  the  jth  iteration  n  <  (rl)Jn0.  Since  J  = 


15  N  j, 


15J, 


j=0 


16  o 


0(nQ),  we  see  that  the  total  time  is  0(nQ)  as  desired.  □ 
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7.  Root-Constrained  Directed  Spanning  Trees. 

This  section  gives  an  algorithm  that  finds  a  smallest  directed 

spanning  tree  rooted  at  a  given  vertex  v ,  such  thau  v  naS  a  yivcn  ucyi  <=c 
q.  The  time  is  0(min(m  log  n,  n^)) ,  the  same  as  for  the  unconstrained 
problem.  First  we  review  the  unconstrained  minimum  directed  spanning 
tree  algorithm.  Then  we  present  our  algorithm.  Finally  we  show  a  lower 

bound  on  the  efficiency  of  any  algorithm  using  our  approach. 


Let  G  be  a  directed  graph  with  n  vertices  and 
m  edges .  An  edge  e  =  (x,y) ,  directed  from  x  to  y ,  has  head  y  and  tail  x;  we  also 
use  the  notation  h(e)  =  y ,  t(e)  =  x.  As  usual  when  we  contract  a  set 

of  edges  C  to  form  G/C,  we  designate  vertices  and  edges  by  their  names 
in  G.  If  e  is  an  edge  of  both  G  and  G/C,  we  write  h(e,G)  or  h(e,G/C) 
when  it  is  necessary  to  specify  the  graph  that  e  is  considered  to  be  in. 

All  paths  and  cycles  we  consider  are  directed.  A  directed  spanning 
tree  rooted  at  v  is  a  set  of  edges  such  that  v  is  the  head  of  no  edge,  and 
every  other  vertex  w  is  the  head  of  exactly  one  edge  and  is  on  a  path 
from  v  to  w.  (This  is  also  called  an  outtree;  our  methods  easily  adapt 
to  intrees.)  We  sometimes  abbreviate  "directed  spanning  tree  rooted  at  v" 
to  "spanning  tree".  Since  we  are  only  concerned  with  trees  rooted  at  v, 
we  can  assume  that  the  gi ven  graph  G  has  no  edges  di rected  to  v. 

This  implies  that  v  is  not  in  any  directed  cycle. 

Now  we  review  Karp's  derivation  of  the  minimum  directed  spanning 
tree  algorithm  [ K] .  We  specialize  the  derivation  from  the  general  case 
of  branchings  to  the  case  of  directed  spanning  trees  rooted  at  v.  An 
edge  is  called  cri tical  if  no  edge  with  the  same  head  has  smaller  cost. 

A  critical  cycle  (spanning  tree)  is  a  cycle  (spanning  tree)  with  all 
edges  critical,  if  a  critical  spanning  tree  exists  it  is  a 
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minimum  spanning  tree.  However,  the  critical  edges  do  not  necessarily 
give  a  spanning  tree:  a  subgraph  composed  of  one  critical  edge  directed 
to  each  vertex  except  v  may  contain  cycles. 

Let  C  be  a  critical  cycle.  The  reduced  graph  for  C  is  the  graph 
G/C,  with  the  cost  function  of  G  modified  as  follows:  If  g  is  an  edge  of 
G  with  t(g)  i  C  and  h(g)  e  C  and  g'  is  the  edge  of  C  with  h(g')  =  h(g), 
then  in  the  reduced  graph  g  costs  c(g)  -  c(g').  All  other  edges  of  G/C 
have  their  cost  unchanged. 

The  following  result,  proved  in  [K],  is  central. 

Lemma  7.1.  Let  C  be  a  critical  cycle.  In  the  reduced  graph  for  C,  let 
T  be  a  minimum  spanning  tree  rooted  at  v.  Let  e  be  the  edge  of  T  with 
h(e,G/C)  =  C.  Let  e'  be  the  edge  of  C  with  h(e',G)  =  h(e,G).  Then  in 

G,  T  +  C-e'  is  a  minimum  spanning  tree  rooted  at  v.  □ 

The  result  implies  an  efficient  algorithm  for  a  minimum  directed 
spanning  tree,  due  to  Edmonds  [E],  The  algorithm  works  in  two  phases. 
Phase  I  finds  critical  edges  and  places  them  in  a  set  F.  In  the  process 
it  finds  and  reduces  critical  cycles  C.  Phase  I  halts  with  the  edges 
of  the  desired  tree  contained  in  F  and  the  reduced  cycles  C.  Phase  II 
extracts  the  desired  tree  from  F  and  the  C's,  using  Lemma  7.1.  It  halts 
with  F  as  the  desired  tree. 


*  e  exists  since  by  assumption  G  has  no  edges  directed  to  v,  whence  v  /  C. 


-66- 


Phase  I: 

1.  F:  =  0; 

2.  while  some  vertex  w  *  v  in  the  current  graph  is  not  the  head  of 

an  edge  in  F  do 

begin 

3.  let  e  be  a  smallest  edge  of  the  current  graph  with  h(e)  =  w; 

4.  if  t(e)  does  not  descend  from  h(c)  in  F  then  add  e  to  F 

■A/Hf  AVvtA» 

else  begin 

5.  let  C  be  the  (critical)  cycle  formed  by  e  and  the  path  from 

h(e)  to  t(e)  in  F; 

6.  reduce  C; 

y  p  •  —  p  q  • 

end  end; 

Phase  II; 

8.  while  the  graph  is  a  reduced  graph  G/C  for  some  cycle  C  do 

begin^ 

9.  let  e  be  the  edge  of  F  with  h(e,G/C)  =  C; 

10.  let  e‘  be  the  edge  of  C  with  h(e‘ ,G)  =  n(e,G); 

11.  F:=F+C-e; 
end; 

This  algorithm,  with  the  correct  choice  of  data  structures,  is 
0(min(m  log  n,  n2))  [CFM,T] . 

Now  we  consider  the  root-constrai ned  directed  spanning  tree  problem. 
Given  is  a  directed  graph  G.  Each  edge  has  a  real-valued  cost  and  a  color; 
the  green  edges  are  directed  from  v,  and  the  remaining  edges  are  red  (red 
edges  may  be  directed  from  v).  We  seek  a  smallest  spanning  tree  rooted  at  v 
with  exactly  q  green  edges. 
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Unlike  the  case  of  undirected  spanni ng  trees ,  the  directed  spanning  trees 
do  not  form  a  matroid.  Instead  they  are  the  intersection  of  a  graphic  and  a 

partition  matroid.  Because  of  this,  the  Augmentation  Theorem  fails  for 
directed  spanning  trees.  This  is  illustrated  by  the  directed  graph  in 
Figure  7.1.  Figure  7.2  gives  the  optimal  trees  T^  and  T^  (with  2,  3,  and 
4  red  edges,  respectively),  all  of  which  are  unique.  These  trees  do  not 
differ  by  a  simple  swap  of  edges. 

To  overcome  this  difficulty  we  broaden  the  notion  of  swap  so  that 
several  swaps  may  be  used  to  derive  one  tree  from  another.  More  precisely, 
if  T  is  a  directed  spanning  tree  rooted  at  v,  (e,f)  is  a  swap  for  T  if 

edge  e  e  T  is  green,  f  £  T  is  red,  and  h(e)  =  h(f).  As  usual  the  cost  of 

(e,f)  is  c(f)  -  c(e). 

(e,f)  is  a  complete  swap  if  T-e  +  f  is  a  spanning  tree.  This  holds 
if  and  only  if  t(f)  does  not  descend  from  h(f)  in  T.  Otherwise,  if  T-e+f 
is  not  a  spanning  tree,  (e,f)  is  an  incomplete  swap. 

Now  we  give  our  algorithm  for  the  root-constrained  directed  spanning 
tree  problem.  Let  the  given  cost  function  be  c.  Let  pi  =  1  +  min { |c(e)  -  c(f)  | 

e  is  green,  f  is  red,  h(e)  =  h(f)}.  Define  a  new  cost  function  c'  as  follows: 

c'(e)  =  c(e)  +  pi,  if  e  is  green, 

=  c(e) ,  if  e  is  red. 

Using  cost  c',  execute  Phase  I  of  Edmonds'  algorithm.  This  finds  a  critical 
tree  F  in  a  reduced  graph;  F  contains  all  the  green  edges,  by  choice  of  c'. 

Next  execute  procedure  G,  given  below,  to  make  swaps  until  F 
contains  q  green  edges.  Finally,  execute  Phase  II  to  derive  the 
tree  correspond! ng  to  F  in  the  original  graph. 
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2. 


3. 

4. 

5. 

6. 

7. 


procedure  G; 


begin^ 

while  F  has  more  than  q  green  edges  do^ 

if  no  swap  exists  then  halt  comment  the  desired  tree  does 
not  exist; 

else  begnji^ 

let  (e,f)  be  a  smallest  swap; 

if  t(f)  does  not  descend  from  h(f)  in  F  then  F :  =F-e  +  f 
else  begin 

let  C  be  the  cycle  formed  by  f  and  the  path  from 
h(f)  to  t(f)  in  F; 

reduce  C; 

F:  =  F  -  C; 


end  end  end  G; 


Suppose  G  is  called  on  the  graph  of  Figure  7.1  with  the  cost  function 
shown  and  q  =  2.  Figure  7.3  illustrates  the  swaps  done  by  G.  An  incomplete 
swap  is  illustrated  by  showing  the  reduced  graph;  a  complete  swap  is 
illustrated  by  showing  the  new  tree  F  (in  the  reduced  graph). 

Now  we  show  that  the  algorithm  is ' correct.’  The  desired  tree  remains  the 
same  if  the  given  cost  function  c  is  changed  by  adding  a  constant  to  all 
green  edges.  Call  such  a  function  d_.  We  will  exhibit  a  d_  such  that  the 
following  is  true: 

(1)  Every  time  the  algorithm  adds  a  red  edge  to  F,  it  is  critical 

(for  d) . 

(2)  Every  reduction  is  made  for  a  critical  cycle  (for  d). 

(3)  Every  green  edge  in  F  at  the  end  of  G  is  critical  (for  d)  in 

the  final  graph. 

Using  Lemma  7.1,  it  is  easy  to  show  that  these  conditions  imply 
the  algorithm  finds  a  minimum  directed  spanning  tree  rooted  at  v  for  the 
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cost  function  d.  Also  it  is  clear  that  the  algorithm  finds  a  tree  with 
q  green  edges.  Together  these  properties  show  the  algorithm  is  correct. 

Now  consider  the  first  step  of  the  algorithm.  Phase  I.  d,  c,  and 
c'  are  identical  on  the  red  edges.  Thus  the  first  step 

satisfies  (1)  -  (3).  Hence  it  remains  only  to  show  (1)  -  (3)  for  procedure  G 
We  begin  by  introducing  some  notation.  If  b  is  any  cost  function, 
call  an  edge  b-critical  if  it  is  critical  when  b  is  the  cost  function; 
similarly  for  a  tree  or  cycle.  Let  p  be  the  number  of  iterations  of  the 

loop  of  lines  1-8  of  G.  Let  the  swaps  done  by  G  be  (e. ,f. ),  i  =  1,  ...,p. 

For  i  =0,  1,  ...,  p  let  G.  be  the  graph  at  the  end  of  the  iteration; 
let  c-j  be  the  cost  function  at  the  end  of  the  i^  iteration;  if  swap  (e. ,f. ) 
is  incomplete,  let  C.  be  the  critical  cycle  that  is  reduced.  Hence  c.  is 

the  cost  function  in  G.. :  (e^  ,f  .)  is  a  smallest  swap  for  C-j_p  if  C-j  exists 

then  G^  =  G.  ^/Cj.  (Although  cycles  are  indexed  from  1  to  p,  C..  exists 
only  for  incomplete  swaps  (e^ ,f. )). 

We  start  by  showing  the  red  edges  of  F  are  critical  with  respect 
to  other  red  edges. 

Lemma  7.2.  In  graph  G. ,  let  f  and  g  be  red  edges,  with  feF,  g^F,  and 
h(f )  =  h(g) .  Then  c^g)  >  c^f). 

Proof.  The  proof  is  by  induction  on  i .  For  i  =  0  the  Lemma  holds  because 
Phase  I  satisfies  (1)  -  (2)  (and  d  =  c  on  red  edges). 

For  the  inductive  step,  suppose  the  Lemma  holds  for  G_._^  and 
consider  swap  (e. ,f.).  If  this  swap  is  complete,  line  4  adds  f..  to  F.  The 
Lemma  holds  for  f. ,  since  (e. ,f. )  is  a  smallest  swap.  It  continues  to  hold 
for  other  red  edges  feF.  Hence  it  holds  for  all  red  edges  f. 

Next  suppose  (e. ,f. )  is  incomplete.  In  G. ,  if  f  is  a  red  edge  of 
F  then  h(f,G. )  *  C.,  since  h  ( e  -  ,G. )  =  .  By  induction 

the  Lemma  holds  for  f.  □ 
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In  Figure  7.3  the  cost  of  swaps  is  increasing.  This  illustrates 
the  following  result. 

Lemma  7.3.  The  cost  of  swaps  i s  nondecreasi ng ,  i . e . ,  for  i  =  1 ,  . . . ,  p  -  1 , 

ci+l  (ei+ljfi+l)  "  ci-l^ei,fi';* 

Proof.  First  suppose  (e. 9f. )  is  a  complete  swap.  Line  4  does  not  create 

new  swaps,  nor  does  it  change  the  cost  of  existing  swaps.  The  desired 

inequality  follows,  since  (e. ,f. )  has  smallest  cost. 

Next  suppose  (e.,f.)  is  incomplete.  If  (e.+1,f.+1)  is  valid  before 

(e..,f. }  is  executed,  its  cost  is  not  changed  by  lines  5-7  (even  if 

h(f-,i,G.  1  eC.).  Again  the  desired  inequality  follows.  Thus  suppose 
i+l  i-l  i 

( ei+1 ,f i+1 )  is  not  valid  before  (e^ ,f ^ ) -  It  is  easy  to  see  that  both 
h(e . , ,  ,G .  , ),  h(f.,,,G.  J  e  C . .  Since  only  one  green  edge  of  F  has  its 

i+i  i-l  1+i  i-i  i 

head  in  C.,  e.+1  =  .  So  when  C.  is  reduced  e!+1  =  fy  Also 

ci-l(fi+l>  “  ci-l(fi+l)  by  Lemma  7-2’  Thus  cFfi+P  =  Ci-l(fi+F  ' 

ci-l(fi+l)  a  °>  and  ci(ei+l^  =  So  c1(e1+l*f1+l)  * 


ci-l(fi) 


(e. ),  which  is  the  desired  inequality.  □ 


Lemma  7.3  is  the  analog  of  Corollary  3.3  for  matroids.  As  with  the 

latter  result,  Lemma  7.3  implies  that  the  inequality  version  of  our  problem 
(i.e.,  finding  a  smallest  spanning  tree  where  the  root  has  degree  at  least  or 

at  most  q)  can  be  solved  as  efficiently  as  the  equality  version. 

Now  we  derive  the  analog  of  Lemma  7.2  for  green  edges. 

Lemma  7.4.  In  graph  G. ,  let  e  and  g  be  green  edges  with  e  e  F,  g  i  F, 
and  h(e)  =  h(g).  Then  c^(g)  >  c.(e). 

Proof.  In  G.,  if  h(e)  =  h(g)  and  c.(g)  >  c.(e),  then  c . ( g)  >  c.(e)  for 

l  *  Jo 

all  j  >  i.  This  is  true  because  any  reduction  after  the  i^h  iteration 
decreases  the  cost  of  g  and  e  by  the  same  amount.  Now  it  is  easy  to  see 
that  the  Lemma  follows  from  this  special  case:  g  =  e^  in  a  complete  swap 
(e^,f^) ;  C. ,  i  >  k,  is  the  first  cycle  to  be  reduced  that  contains  h(g,G^); 
and  e  =  e.  in  the  incomplete  swap  (e.  ). 
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Lemma  7.3  shows  cj_i(ej  >f-j )  -  ck-l^eksfk^'  Thus  ci^  =  c i - 1  ^ e i  ^  “ 

ci-l^fi>  =  -  c1-l(VV  s  -  ck-l(ek’fk}  =  ck-l(9}  "  ck-l(fk)-  Note  that  ck-l(g) 

c.  n  ( g )  and  c,,  =  cn.  (f,^)  9  by  the  hypotheses  of  the  special  case. 

Hence  c.(e)  >  c._^(g)  -  c^ _ ^ ( f ^ )  =  c^ ( g )  as  desired.  □ 

Now  we  show  that  the  algorithm  satisfies  properties  ( 1 ) - ( 3)  with 
respect  to  the  cost  function  d  defined  as  follows: 

d(g)  =  c(g)  +  cp_1(ep,fp),  if  g  is  green, 

=  c(g)  ,  if  g  is  red. 

Define  similar  functions  d.,  i  =  0,  ...,  p  as  follows: 

d i ( g )  =  c.(g)  +  Cp_1(ep,fp),  if  g  is  green, 

=  c(g),  ,  if  g  is  red. 

Suppose  the  original  cost  function  c  is  changed  to  d.  It  is  easy  to  see 
that  dQ  is  the  cost  function  at  the  end  of  Phase  I,  and  d.. ,  i  =  l,...,p 
is  the  cost  function  at  the  end  of  the  i^h  iteration.  This  follows  by 
induction.  (Observe  that  in  a  reduction  if  c(g)  changes  to  c(g)  -  c(g  ), 
then  g'  is  in  the  cycle,  hence  red,  so  c(g')  =  d(g‘).) 

Also  define  R. ,  i=0,  ...,p  as  the  forest  of  red  edges  in  G..  that 
are  in  F.  Now  it  is  easy  to  see  that  Lemma  7.5  below  is  a  statement  of 
properties  (1)  -  (2),  and  Lemma  7.6  is  property  (3). 

Lemma  7.5.  For  i=0,...,p,  R.  is  d. -critical  in  G. .  Further  for  i  =  1, 

...,p,  if  swap  (e.,f.)  is  incomplete  and  contracts  cycle  ,  then  C..  is 

d.  --critical, 
i  -I 

Proof.  We  use  induction  on  i.  For  the  base  case,  note  that  F  is  initially 
CQ-cri tical  and  contains  all  the  green  edges.  Since  Cq  and  dg  are  identical 

on  red  edges,  RQ  is  dg-cri tical  in  GQ. 

Now  suppose  1  <  i  <  p.  The  ith  iteration  does  swap  (e.,f.).  We 

I  I 

start  by  observing  that  +  f..  is  d^-cri tical.  R.  ^  is 
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d.  -i -critical  by  induction.  Consider  edge  f . .  If  f  is  a  red  edge  with 

h(f)  =  h ( f . ) ,  then  c^^f)  >  c . _ ^ ( f - )  by  the  choice  of  (e.  ,f  . ).  Thus 
d.  ,(f)  >  d_.  ,(f_. )  as  desired.  Next  suppose  e  is  a  green  edqe  with 

i-i  i -i  i  -  - 

h(e)  =  h(f . ) .  Lemma  7.4  shows  c._^(e)  >  c . (e^ ) .  Lemma  7.3  shows 

vdvV  s  ci-i(ei’fi)-  Thus  di-i(e)  =  ci-i(e)  +  vfvV  s 

ci -].(ei )  +  c i _ i ( e i  jfi )  =  ct* _i (f-j )  =  d-j _i(fi )  as  desired. 

Now  suppose  (e .  ,f . )  is  a  complete  swap.  Then  R.  =  R.  n  +  f . ,  and 

II  1  I  “  I  1 

d-j  ~  (S1*nce  =  cj„ j )  *  Thus  the  above  paragraph  shows  R.  is 

d. -critical,  as  desired. 

Next  suppose  (e. ,f. )  is  incomplete.  The  contracted  cycle  C .  is 
11  1 

contained  in  R^  ^  +  f.  and  thus  is  cL^-cri tical .  It  remains  to  show  that 

R..  is  d ..-critical .  An  edge  f  €  R^.  is  in  R.  ^  and  does  not  have  its  head 

in  C.  (since  h(e.,G.  ,)  e  C.).  Hence  no  costs  are  modified  at  vertex 
i  ii-l  i 

h(f*Gi-i)  anc*  f  remains  critical  in  G. .  □ 

Lemma  7.6.  At  the  end  of  procedure  G,  every  green  edge  of  F  is  dp-critical 

in  G  . 

P 

Proof.  Consider  a  green  edge  e  e  F  in  G  .  If  g  is  a  green  edge  with 

P 

h(g,Gp)  =  h(e,G  ) ,  then  cp(g)  >  cp(e)  by  Lemma  7.4,  whence  dp(g)  >  dp(e). 

If  f  is  a  red  edge  with  h(f,G  )  =  h(e,G  ),  then  (e,f)  is  a  swap  in  Gp  =  Gp  ^ 

Since  it  costs  no  less  than  (e  ,f  ),  c  (e,f)  >  c  (e  ,f  ).  Adding 

p  p  p  p  p  p  3 

cp(e)  to  both  sides  gives  d  (f)  >  d  (e).  □ 

Since  properties  (1)  -  (3)  are  established,  the  algorithm  is 
correct.  (It  is  easy  to  see  that  if  procedure  G  halts  in  line  2, 
there  is  no  spanning  tree  rooted  at  v  with  q  green  edges.) 

It  is  not  difficult  to  implement  the  algorithm  in  time 
0(min(m  log  n,  n2))  using  the  data  structures  of  [T ,CFM] .  This  involves 
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using  one  UNION-FIND  structure  to  represent  vertices  of  the  reduced  graph, 
another  to  represent  trees  in  the  forest  F,  and  priority  queues  to  keep 
track  of  the  costs  of  all  edges  with  a  given  head  [T] .  The  cycles  are 
represented  by  a  tree  structure  [CFM] .  In  addition,  procedure  G  uses  a 
priority  queue  of  swaps  (for  line  3). 

Theorem  7.1.  A  smallest  root-constrained  directed  spanning  tree  can  be 

o 

found  in  time  0(min(m  log  n,  n  ))  and  space  0(m+n) . 

Proof,  Correctness  follows  from  Lemmas  7,1-6  and  the  accompanying  discussion. 
Further  details  concerning  the  time  bound  are  left  to  the  reader.  □ 

We  conclude  this  section  with  a  lower  bound  for  our  approach.  Say 
that  an  algorithm  on  a  directed  graph  "uses  the  swap  sequence  approach" 
if,  given  a  graph  as  shown  in  Figure  7.4  (a),  it  determines  the  complete 
swaps  found  by  procedure  G  (with  q  =  l).  Note  that  the  algorithm  need  not 
determine  the  incomplete  swaps;  as  usual  the  algorithm  need  not  determine 
the  order  of  the  complete  swaps.  Also,  although  swaps  may  be  for  reduced 
graphs,  they  are  specified  by  the  edges  of  the  original  graph. 

Lemma  7,7.  Any  algorithm  on  a  directed  graph  that  uses  the  swap  sequence 
approach  requires  £2(sort  (n))  time. 

Proof.  As  in  Lemma  4.1  we  show  that  with  only  0(n)  extra  processing,  the 
algorithm,  called  on  an  n  +  2  vertex  graph,  can  sort  n  numbers. 

Consider  n  arbitrary  numbers  x^ .  ...,x  .  Without  loss  of  generality 
assume  that  the  numbers  are  distinct  and  positive.  (To  achieve  the  former 
condition  break  ties  by  using  the  indices  of  the  numbers.)  These  numbers 
correspond  to  the  directed  graph  shown  in  Figure  7.4  (a).  More  precisely, 
the  graph  contains  vertices  v,  w,  and  u.,  i  =  1,  . ..,  n.  For  each  i,  i  =  1, 

. ..,  n,  there  is  a  green  edge  (v,u^)  costing  -x^ ,  and  red  edges  ( w ,  u . ) 
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and  (u  .,w)  both  costing  x. ;  there  is  also  a  green  edge  (v,w)  costing  0. 

Let  the  given  numbers  in  increasing  order  b y  y . ,y  .  Now  we 

1  n 

show  that  the  swaps  found  by  6  are 

(4)  (  (  V  ,  W )  ,  (Upw)),  (  (  V  9  Up  )  ,  (w.j  U  x  )  )  j  ((V5U1),(U2>W)),  .  ..,  ((VjUj)j  (w,Uj))j 

((v,u.),  (u.+1, w)),  ...  ,((v,un_1),  (w,un_1)),  ((v,un_1),  (un,  w)). 

To  do  this  we  prove  inductively  that  for  i  =  1,  ...,n  the  first  2 i -1 
swaps  are  as  in  (4)  and  give  the  graph  shown  in  Figure  7.4  (b).  There 
vertices  Up  ...,  u-  ^  and  w  are  contracted  into  a  new  vertex,  which  we 
call  w;  the  edges  directed  to  this  vertex  have  their  cost  modified  so  that 
(u.,w)  costs  x.  -  x.  for  j  >  i;  the  spanning  tree  consists  of  edges 
(v,u.)»  J  £  i  and  (u. ,w). 

For  the  base  case  i  =  1,  note  that  in  the  original  graph  the  smallest 
swap  at  vertex  w  is  ((v,w),  (u^,w))  costing  Xp  and.  the  smallest  swap  at 
u.j  is  ((v,u  .),  (w,u.))  costing  2  x^ .  Hence  the  first  swap  is  as  in  (4). 

Since  this  is  a  complete  swap  no  contraction  or  cost  modification  occurs. 

This  gives  Figure  7.4  (b)  for  i  =  1,  where  we  take  Xg=  0. 

Now  assume  the  first  2 i - 1  swaps  are  as  in  the  induction  hypothesis. 

We  do  the  inductive  step  by  analyzing  the  next  two  swaps.  For  the  2i^ 

swap,  the  smallest  swap  at  u.  (j>i)  is  ((v,u.),  (w,u.))  costing  2x.,  and 

J  J  j  J 

there  are  no  swaps  at  w.  Hence  the  2it'1  swap  is  ((v,u-),  (w,u.))  ss  in 

(4).  This  incomplete  swap  contracts  vertex  u.  into  w,  changing  the  cost 

of  (u  -,w)  to  x.-x.,  j  >  i ;  the  new  green  edge  (v,w)  gets  cost  -2x. . 

J  0  ' 

For  the  2i  +  1st  swap,  the  smallest  swap  at  w  is  ( ( v,w) , 

(u.,15w))  costing  x.x1  +  x. .  (Note  that  (v,w)  derives  from  the  original  edge 

i+i  i+i  i 

( v,u  . ) ,  so  this  swap  is  also  designated  as  ( (v,u  . ) ,  (u.,19w))).  The 

i  ii+i 

smallest  swap  at  u.  ( j  >  i )  is  still  ( ( v , u  . ) ,  (w,u.))  costing  2  x  • .  Hence 

J  J  J  J 
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the  2  i  +  1st  swap  is  ( ( v , u ^ 5  (u-+^,w)),  as  in  (4).  Since  this  is  a  complete 
swap  it  is  easy  to  see  that  the  rest  of  the  inductive  assertion  holds.  This 
completes  the  induction  and  establishes  (4). 

Now  we  describe  the  procedure.  Given  numbers  x^9  ...»  x  construct 
the  graph  of  Figure  7.4  (a)  and  call  the  algorithm  to  find  the  complete 
swaps.  (These  are  the  odd  swaps  of  (4)).  Then  set  an  array  S  so  that 
S(i)  =  j  if  ((v,u.) ,  ( u . ,w) )  is  a  complete  swap;  further,  S ( 0 )  =  j  where 
((v,w),  (u.,w))  is  a  complete  swap.  (4)  implies  that  the  sequence  S(0), 

S^(0),  ...,Sn(0)  gives  the  indices  of  the  numbers  in  sorted  order.  Since 
besides  the  algorithm  only  0 ( n )  time  is  used,  this  gives  the  desired  result.  □ 

Lemma  7.7  can  be  extended  to  incorporate  a  lower  bound  on  the  time  to 
find  a  minimum  directed  spanning  tree,  as  in  Corollary  4.1. 
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Figure  3.1 

Restricted  swap  sequence. 
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8  1 


(b) 

Figure  3.2. 

Algorithm  A. 

( a )  B-G-^  +  R^ 

(b)  The  two  recursive  calls. 
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Figure  7.1 

Example  directed  graph. 


Figure  7.2 

Optimal  trees  for  Figure  7.1. 
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(c) 

(-2,1)  -  complete 


Figure  7.3. 

Swaps  done  by  procedure  G. 


